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REGULATION OF GENE EXPRESSION IN TOBACCO FOR MANIPULATION OF 
PLANT GROWTH AND SECONDARY METABOLISM 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application is a continuation-in-part of US Patent Application Ser. No. 60/ 132,919, filed 
May 6, 1999, now abandoned, which is hereby incorporated by reference in its entirety herein. 

FIELD OF THE INVENTION 
This invention relates to enzymes involved in alkaloid, and specifically nicotine, formation in 
tobacco plants. The invention is based, at least in part, on the nucleotide sequences encoding four 
variants of putrescine N-methyltransferase (PMT1, PMT2, PMT3, and PMT4), two variants of 
arginine decarboxylase (ADC 1 and ADC2), ornithine decarboxylase (ODC), S-adenosylmethionine 
synthetase (SAMS), a fragment of NADH dehydrogenase, and a fragment of 
phosphoribosylanthranilate isomerase. The invention also relates to proteins expressed by these 
nucleotides, promoter regions of these nucleotides, use of these promoter regions to culture 
transgenic plant cells and to produce transgenic plants, sense and antisense nucleotides 
complementary to all or portions of these nucleotide sequences, use of sense and antisense 
nucleotides to regulate gene expression, and assays using proteins involved in alkaloid formation in 
tobacco plants. 

BACKGROUND OF THE INVENTION 

I. Alkaloid Formation 

Alkaloids are one of the most diverse groups of secondary compounds found in plants and 
they are the product of a complex biosynthesis pathway (Hashimoto and Yamada, 1994; Chou and 
Kutchan, 1998; Waterman, 1998). Why plants accumulate these compounds and in so many 
different forms is not known. Moreover, for many alkaloids, the exact site of synthesis and the 
factors that control their intercellular distribution and accumulation remain to be determined 
(Hashimoto and Yamada, 1994; Kutchan, 1995; Chou and Kutchan, 1998). 

Nicotine is the most abundant alkaloid present in cultivated tobacco. Nicotine is formed 
primarily in the roots of the tobacco plant and subsequently is transported to the leaves, where it is 
stored (Tso, Physiology and Biochemistry of Tobacco Plants, pp. 233-34, Dowden, Hutchinson & 
Ross, Stroudsburg, Pa. (1972)). 

The synthesis and accumulation of nicotine and other tobacco alkaloids are known to be 
controlled by various developmental, environmental, and chemical cues. Changes in phytohormone 
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(e.g., auxin, cytokinin) levels and/or ratios as a consequence of developmental age (Hashimoto and 
Yamada, 1994; Kutchan, 1995) or by direct manipulation of plant cell culture conditions have been 
shown to affect the synthesis and accumulation of nicotine and various tobacco alkaloids (Hashimoto 
and Yamada, 1994; Hibi et al. 3 1994; Eilbert, 1998). Various abiotic factors (wounding, drought 
5 stress, pH imbalance, etc.) [Hashimoto and Yamada, 1994; Kutchan, 1998; Waterman, 1998) 1, 2, 4], 
as well as biotic factors, such as herbivory, insect feeding, and attack by various microbial and fungal 
pathogens, are known elicit increased production of nicotine and other alkaloids in the leaves of wild 
and cultivated tobacco species (Baldwin, 1989; Saito and Murakoishi, 1998; Baldwin and Prestin, 
1999). In addition, the commercial practice of topping (i.e., removal of flowering head and young 

10 leaves at the upper portions of the plant), results in increases in nicotine and the amount and 

complexity total alkaloids present in the leaves of Nicotiana tabacum (Hashimoto and Yamada, 
1994; Hibi et aL, 1994). The factors controlling the topping-induced increase in alkaloid 
biosynthesis are not known, but likely involve a complex physiological response in the plant as a 
result of altered phytohormones and wound induced signaling (Akehurst, 1981; Hibi et aL, 1994; 

15 Kutchan, 1998). In this regard, considerable evidence now exists indicating that a jasmonic acid 
(J A)- mediated signal transduction pathway may play a role in regulation of gene expression 
contributing to this increase in alkaloid biosynthesis (Baldwin et aL, 1994, 1996,1997; Ohnmeiss et 
aL, 1997; Imanishi etaL, 1998a, 19986). 

The nicotine molecule is comprised of two heterocyclic rings, a pyridine moiety and a 

20 pyrrolidine moiety, each of which is derived from a separate biochemical pathway. The pyridine 
moiety of nicotine is derived from nicotinic acid. The pyrrolidine moiety of nicotine is provided 
through a pathway leading from putrescine to N-methylputrescine and then to N-methylpyrroline. 
(Goodwin and Mercer, Introduction to Plant Biochemistry, pp. 488-91, Pergamon Press, New York, 
(1983)). 

25 Putrescine is formed in plants by one of two pathways (Chattopadhyay and Ghosh, 1998). It 

can be synthesized directly from ornithine, in a reaction catalyzed by the enzyme ornithine 
decarboxylase (ODC, EC 4.1.1.17), or formed indirectly from arginine in a reaction sequence 
initiated by arginine decarboxylase (ADC, EC 4.1.1. 19). Putrescine formed by the ADC and/or ODC 
pathway serves as precursor in the synthesis of the higher polyamines, spermine and spermidine, 

30 catalyzed by the enzymes spermine synthase and spermidine synthase, respectively, or it is converted 
to N-methylputrescine by the action of putrescine N-methyltransferase (PMT), the first committed 
step in nicotine biosynthesis (Hashimoto and Yamada, 1994; Kutchan, 1995; Chattopadhyay and 
Ghosh, 1998). N-methyl putrescine is oxidized by a diamine oxidase and cyclized to form the 1- 
methyl-A^pyrrolium cation, which is condensed with nicotinic acid or its derivative to form nicotine 
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(Hashimoto and Yamada, 1994). 

Putrescene is a precursor for N-methylputrescine, which then forms N-methylpyrroline. 
Conversion of putrescine to N-methylputrescine is catalyzed by the enzyme putrescine 
N-methyltransferase ("PMT"), with S-adenosylmethionine serving as the methyl group donor. PMT 
appears to be the rate-limiting enzyme in the pathway supplying N-methylpyrroline for nicotine 
synthesis in tobacco (Feth et al., "Regulation in Tobacco Callus of Enzyme Activities of the Nicotine 
Pathway", Planta, 168, pp. 402-07 (1986); Wagner et al., "The Regulation of Enzyme Activities of 
the Nicotine Pathway in Tobacco", Physiol. Plant., 68, pp. 667-72 (1986)). 

II. TRANSGENIC PLANTS 

The methods of nicotine formation in tobacco and the genes involved have been studied both 
to better understand differential gene expression during tobacco growth and development, and also to 
discover tools useful for creating transgenic plants. For example, the regulatory sequences that 
modify protein expression in tobacco may be useful in creating transgenic tobacco or other 
transgenic plants. 

It has already been demonstrated that tissues of many plant species may be transformed by 
exogenous, typically chimeric, genes which are effective to stably transform cells of the tissues. For 
several species, tissues transformed in this fashion may be regenerated to give rise to whole 
transqenic or genetically engineered plants. The engineered traits introduced into the transgenic 
plants by these techniques have proven to be stable and have also proven to be transmissible through 
normal Mendellian inheritance to the progeny of the regenerated plants. One such desirable trait is 
the production in the plant cells of desired gene products in vivo in the cells of the transqenic plants. 
For a chimeric gene to be effective, the foreign DNA sequence containing a coding region should be 
flanked by appropriate promotion and control regions. Commonly used plant cell transcription 
promoters include the nopaline synthase promoter from the T-DNA of A. tumefaciens and the 35S 
promoter from the cauliflower mosaic virus. 

In order for the newly inserted chimeric gene to express the protein for which it codes in the 
plant cell, the proper regulatory signals must be present and in the proper location with respect to the 
gene. These regulatory signals include a promoter region, a 5' non-translated leader sequence and a 3' 
polyadenylation sequence. A promoter is a DNA sequence that directs the cellular machinery of a 
plant to produce RNA from the contiguous structural coding sequence downstream (3') to the 
promoter. The promoter region influences the rate at which the RNA product of the gene and 
resultant protein product of the gene is made. The 3' polyadenylation signal is a non-translated region 
that functions in 
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the plant cells to cause the addition of polyadenylate nucleotides to the 3* end of the RNA to enable 
the mRNA to be transported to the cytoplasm and to stabilize the mRNA for subsequent translation 
of the RNA to produce protein. 

Other plant cell transformation techniques are directed toward the direct insertion of DNA 
5 into the cytoplasm of plant cells from which it is taken up, by an uncharacterized mechanism, into 
the genome of the plant. One such technique is electroporation, in 

which electric shock causes disruption of the cellular membranes of individual plant cells. Plant 
protoplasts in aqueous solution when subject to electroporation will uptake DNA from the 
surrounding medium. Another technique involves the physical acceleration of DNA, coated onto 

10 small inert particles, either into reqenerable plant tissues or into plant germline cells. 

The availability of cloned nucleic acid sequences encoding an enzyme involved in alkaloid 
synthesis allows for the potential manipulation of alkaloid contents. Furthermore, the availability of 
promoters useful for expressing genes in plants allows for the creation of chimeric molecules and 
transgenic plants, which in turn result in possible native plant production of desirable proteins. 

15 Previously reported work discloses cloning nucleotides encoding proteins involved in the 

biosynthesis of nicotine, and isolating such proteins. Approximately twenty or more cDNAs and/or 
genomic DNA fragments encoding different enzymes involved with alkaloid formation have been 
isolated (Chattopadhyay and Ghosh, 1998). For example, successful cloning of partial or full-length 
cDNA encoding ODC activity from tobacco was disclosed by (Malik et al 9 J. Plant Biochem. 

20 &Biotech. 5: 109-1 12 (1996)). Also, a relatively crude preparation of PMT (30-fold purification) has 
been subjected to limited characterization (Mizusaki et al., "Phytochemical Studies on Tobacco 
Alkaloids XIV. The Occurrence and Properties of Putrescine N-Methyltransferase in Tobacco 
Plants", Plant Cell Physiol., 12, pp. 633-40 (1971)). A process for purifying PMT is disclosed in US 
Patent No. 5,369,023, "Method of purifying putrescine n-methyltransferase from tobacco plant 

25 extract with an anion exchange medium", hereby incorporated by reference in its entirety herein. 

Several laboratories have reported the cloning of partial or full-length cDNAs encoding ADC (Bell 
and Malmberg ,1990; Rostogi et al., 1993; Perez-Amador et al., 1995; Nam et al., 1997; Watson and 
Malmberg, 1996). Comparisons of the amino acid sequences of ADC from various plants revealed a 
high degree of conservation among the various proteins, as well as homology to ODC (Malmberg et 

30 al., 1998). 

It is an object of the present invention to characterize the nucleotide and amino acid 
sequences of enzymes involved in the biosynthesis of nicotine in tobacco. It is also an object of the 
present invention to provide plant promoter regions that are capable of conferring high levels of 
transcription in rapidly dividing cells of transformed plants when coupled with a heterologous coding 
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sequence in a chimeric gene. Further, the invention is directed to chimeric genes incorporating such 
promoter regions, stable transfection of plants with these chimeric genes, and the plants and cells that 
are transfected, as well as seeds of such transfected plants. It is a further object to characterize sense 
and antisense nucleotides capable of regulating expression of genes encoding for enzymes involved 
5 in the biosynthesis of alkaloids. 

SUMMARY OF THE INVENTION 
Proteins involved in the biosynthesis of nicotine in tobacco N tabacum are the subject of this 
invention. More specifically, the invention concerns four variants of putrescine N-methyltransferase 
1 0 (PMT1 , PMT2, PMT3, and PMT4), two variants of arginine decarboxylase (ADC 1 and ADC2), 

ornithine decarboxylase (ODC), S-adenosylmethionine synthetase (SAMS), NADH dehydrogenase, 
and phosphoribosylanthranilate isomerase. 

BRIEF DESCRIPTION OF THE FIGURES 

15 Figure 1. Genomic DNA gel blot analysis of the PMT gene family in N tabacum cv. Xanthi. 

Total genomic DNA (30 |ig) was digested with Kpnl, EcoRI, or EcoRl and Kpnl, separated by 
agarose gel electrophoresis, and transferred to nylon membranes. The membrane was hybridized 
with a 32p_i a beled antisense strand probe covering the complete coding region of the NtPMTla 
cDNA. Identity of the hybridizing bands as determined by comparison to phage DNA digests is 

20 indicated. Molecular weights are given in kb. Note that Kpnl shifts only the NtPMTlb band in the 
gel blot because this restriction site is present ony in Exon 1 of NtPMTlb and not NtPMTla. 

Figure 2. Amino acid sequence alignment of N tabacum PMTs. Shown is a PILEUP alignment of 
the predicted amino acid sequences of the various tobacco PMTs. Amino acid residues that differing 

25 among the PMTs are shaded. NtPMTla, NtPMT2, NtPMT3, and NtPMT4 refer to the deduced 
amino acid sequences of the PMTs encoded by the NtPMTla, NtPMT2, NtPMT3, and NtPMT4 
genes, respectively, isolated from N tabacum cv. Xanthi genomic DNA; cNtPMTla is the predicted 
amino acid sequence of the A41 1 cDNA (Accession No. D28506) isolated from N tabacum cv. 
Burley 21 by Hibi et al. (1994). The location of the exon-intron boundaries are indicated by the dark 

30 vertical line. The nucleotide sequences for NtPMTla, NtPMT2, NtPMT3, and NtPMT4 appear in 
GenBank under the accession numbers AF126810, AF126809, AF12681 1, and AF126812, 
respectively 

Figure 3. Polyacrylamide gel electrophoresis analysis of PCR amplified genomic DNA fragments 
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encoding Exon 1 of PMT from various species of Nicotiana. PCR amplification was carried out as 
described in the Materials and Methods using Exon 1 -specific primers 1 and 2 and total genomic 
DNA isolated from N. tabacum, N. otophora, and TV. tomentosiformis. The amplification products 
were separated by electrophoresis on 6.5% polyacrylamide gels, the gels fixed, and subject to 
5 autoradiography. The amplification products isolated from N. tabacum cv. Burley 21 and N. tabacum 
cv. Xanthi were identical and only the amplication products from the reactions with N. tabacum cv. 
Burley 21 DNA are shown. Standards were generated in identical reaction conditions primed with 
plasmid DNA encoding the various PMT genes isolated in this study. 

10 

Figure 4. Nucleotide sequence alignment of the 5'-flanking regions of the N. tabacum PMT genes. 
Shown is a PILEUP alignment of the nucleotide sequences upstream of the initiating methionine 
(MET) codon of the four PMT genes isolated from TV. tabacum cv. Xanthi. The proposed start site 
for transcription of the NtPMTla gene is indicated by the +1 above the sequences. The TATA-box 

15 and CCAAT-box motifs are boxed. Potential transcriptional regulatory elements identified by 
MOTIF search programs are also boxed and indicated by the following abbreviations:. PAL: 
palindromic sequences; G-Box: G-Box homologous sequences; MRE: metal-responsive 
element homolog. Nucleotides identical in three or more sequences are shaded. The polyguanine- 
rich region is underlined. Numbering is indicated to the right and is relative to the proposed start site 

20 of each gene. 

Figure 5. RNA gel blot analysis of PMT* transcript levels in various tissues Total RNA was isolated 
from various tissues of mature N. tabacum cv. Burley 21 and analyzed by gel blot analysis using a 
32 P-labeled NtPMTla cDNA coding region (Exons 2 to 8) probe capable of detecting all PMT 
transcripts. 

25 A. PMT transcript levels in various tobacco plant tissues and/or organs. 

B. Induction of PMT expression in tobacco roots following topping. Abbreviations: HP, wild-type 
(NiclNic2) Burley 21; LP, low alkaloid (niclnicl) mutant. The (3-subunit of mitochondrial 
ATPase ((3-ATPase) served as a control. 

30 Figure 6. Semi-quantitative RT-PCR analysis of PMT gene expression in roots of tobacco plant 
before and after topping. 

A. Shown is relative abundance of the individual PMT gene transcripts before and after topping. RT- 
PCR was carried out as described in the Material and methods using Exon 1 specific primers. 
Messenger RNA was amplified from total RNA isolated from the roots of wild-type (HP, 
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NiclNic2) Burley 21 and low alkaloid (LP, niclnicl) Burley 21 tobacco plants. Far right lane 
represents size standards for the genes isolated by PCR amplification from plasmid DNA. The P- 
subunit of mitochondrial ATPase (P-ATPase) served as a control. 

5 B. Bar graphs showing relative expression of the individual PMT genes following topping in both HP 
and LP tobacco roots. Abbreviations: HP, wild-type (NicJNic2) Burley 21; LP, low alkaloid 
{niclnicl) mutant. 

Figure 7. The nucleotide and predicted amino acid sequences of the transcribed portions of the N. 

10 tabacum cv Xanthi NtADCl and NtADC2 genes. Shown are the complete nucleotide and predicted 
amino acid sequence of the N. tabacum cv Xanthi NtADCl gene and where it differs from the 
NtADC2 gene sequence. The dots indicate nucleotide sequence identity and the stars indicate amino 
acid sequence identity. The proposed polyadenylation signal is underlined. The sequences terminate 
at the point of polyadenylation found in the full length cDNA (Wang, 1999; AF127239).The 

15 complete nucleotide sequences for the N. tabacum cv Xanthi NtADCl (AF 127240) and NtADC2 
(AF127241) including the 5' and 3 f flanking sequences appear in Genbank. 

Fig. 8. Comparison of the predicted amino acid sequences of arginine decarboxylases (ADCs) from 
various species. Shown is a PILEUP alignment of the predicted amino acid sequence of the N. 
20 tabacum cv Xanthi NtADCl gene (AF 127240) aligned to the predicted ADC protein sequences from 
N. sylvestris (AB 12873), Arabidopsis thaliana (AF009647), Avena sativa (oat) (X56802), 
Lycopersicon esculentum (tomato) (LI 65 82) and Escherichia coli (M31770). Amino acid residues 
conserved among the various ADC are shaded. 

25 Fig. 9. Gel blot analysis of ADC transcript levels in the roots of wild-type and low alkaloid mutant 
Burley 21 tobacco before and after topping. Total RNA was isolated from the roots of mature wild- 
type and low alkaloid mutant N. tabacum cv. Burley 21 and analyzed by gel blot analysis using [a- 
32 P]-dCTP labeled probes recognizing the coding region of ADC or the p-subunit of tobacco 
mitochondrial ATP synthase (Boutry and Chua, 1985). Quantitation was carried out by 

30 phosphorimaging using a Molecular Dynamics Phosphorlmager. Values were normalized relative to 
the intensities of the atp2 control band in each lane. The experiment was conducted twice with 
different total RNA samples. 
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Fig. 10. Differential expression of NtADC-1 and NtADC-2 in various tissues of tobacco. Expression 
of the NtADC-1 and NtADC-2 genes was analyzed using semi-quantitative RT-PCR and gene 
specific primers capable of discriminating between transcripts arising from the two genes. Panel A 
shows control reactions demonstrating primer specificity in the PCR reactions using plasmids 
5 containing the NtADC-1 and NtADC-2 coding sequences. The numbers above the lane refer to the 
specific primer combinations as described in the Materia and methods. Panel B shows the results of 
RT-PCR reactions using first strand cDNA synthesized from total RNA extracted from either root, 
leaf, or flowers. As a internal control, primers specific for the atp2 gene transcript were include in the 
amplification reactions. All reactions were carried out within the linear range of template 
10 amplification as determined by varying template amount, amplification time, and temperature as 
described in Riechers and Timko (1999). 

Fig. 11. Genomic DNA gel blot analysis of the ODC gene family in N. tabacum. Total genomic 
DNA (30 /ug) was digested with EcoRI or HindUI, fractionated by agarose gel electrophoresis, 
15 transferred to nylon membranes and hybridized with an a- 32 P-dCTP labeled probe encoding full- 
length ODC cDNA as described in the Materials. The mobility of molecular weights standards are 
given to the right of the figure in kilobases (kb). 

Fig 12. Comparison of the nucleotide and predicted amino acid sequences of the NtODC-1 and 
20 NtODC-2 genes. Shown are the nucleotide and predicted amino acid sequences of the NtODC-1 

(AF233850) and NtODC-2 (AF233849) genes. In the figure, the complete amino acid sequence of 
the pODC2 is given and the pODCl sequence is given only where it differs. The start site of 
transcription is designated as +1 and the poly(A) addition site is indicated by the arrow. Within the 
relevant regions of homology, nucleotide differences between the NtODC-1 and NtODC-2 genes are 
25 in bold lettering. The proposed TATA-box, and polyadenylation signal are shaded. 

Fig. 13. Protein sequences alignment of ornithine decarboxylases (ODCs) from various species. 
Shown is a PILEUP alignment of the predicted amino acid sequences of the N. tabacum cv. Xanthi 
pODC2 protein (AF233849) with the ODCs from N. tabacum cv. SC58 (Y10472) and cv. BY-2 
30 (ABO31066), Lycopersicon esculentum (tomato) (AF030292), Datura stramonium (jimsonweed) 
(X87847), Saccharomyces cerevisiae (NP_012737), and humans (Homo sapiens', AAA59966). 
Amino acid residues conserved among the various ODCs are shaded. 
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Fig. 14. Gel blot analysis of ODC transcript levels in various tissues of mature tobacco plants and 
in the roots before and after topping. Total RNA was isolated from various tissues of mature N. 
tabacum cv. Burley 21 and analyzed by gel blot analysis using an a- 32 P-dCTP labeled coding region 
probes for ODC. (A) Transcript levels in various organs of wild-type tobacco: R, root: S, stem ; L, 
leaf; SE, sepal; PE, petal; O, ovary; S, stamen; and AN, anther. (B) Transcript levels in roots of 
Burley 21 tobacco plants before and after topping. RNA gel blot analysis of the tissues-specific 
distribution and post-topping expression of transcripts encoding ODC in tobacco. As a control, the 
blots were also probed with radioactively labeled probes encoding the alkaloid biosynthesis enzyme 
putrescine N-methyltransferase (PMT) and a root specific P-glucosidase (TBG-1). 

DETAILED DESCRIPTION OF THE INVENTION 
Nucleic acid sequences have been isolated from tobacco that encode important enzymes in 
nicotine and total alkaloid formation, including PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, 
and SAMS. Also identified are cDNA fragments encoding partial segments of NADH 
dehydrogenase and phosphoribosilanthronilate isomerase. Also identified are promoter regions for 
the nucleotides encoding PMT1, PMT2, PMT3, PMT4, and ADC2. All of these nucleic acids are 
isolated from Nicotiana tabacum L. 

"Promoter" and "promoter region" are terms used interchangeably herein to refer to a DNA 
sequence that regulates expression of a selected DNA sequence operably linked to the promoter, and 
which effects expression of the selected DNA sequence in cells. The term also encompasses the 
5'untranslated region that may be transcribed into mRNA but is not translated. 

"Protein", "polypeptide", and "peptide" are used interchangeably herein when referring to a 
gene product. 

In one aspect, the invention features isolated nucleic acid molecules encoding for PMT1, 
PMT2, PMT3, PMT4, ADC1, ADC2, ODC, and SAMS, a fragment of NADH dehydrogenase and a 
fragment of phosphoribosilanthronilate isomerase. The disclosed molecules can be non-coding (e.g. 
probe, antisense or ribozyme molecules) or can code for a functional enzyme. In one embodiment, 
the nucleic acid molecules can hybridize to the nucleic acid sequences encoding for PMT1, PMT2, 
PMT3, PMT4, ADC1, ADC2, ODC, SAMS, a fragment of NADH dehydrogenase, or a fragment of 
phosphoribosilanthronilate isomerase or to the complements of these nucleic acid sequences. In a 
preferred embodiment, the hybridization is conducted under mildly stringent or stringent conditions. 

In further embodiments, the nucleic acid molecule is at least 50%, 60%, 70%, 80% and more 
preferably at least 90% or 95% homologous in sequence to the nucleic acid sequences encoding for 
PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, SAMS, a fragment of NADH dehydrogenase, or 
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a fragment of phosphoribosilanthronilate isomerase or to the complements of these nucleic acid 
sequences. In another embodiment, the nucleic acid encodes a polypeptide that is at least 50%, 60%, 
70%, 80% and more preferably at least 90% or 95% similar in sequence to the amino acid sequence 
of PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, SAMS, the fragment disclosed herein of 
5 NADH dehydrogenase, or the fragment of phosphoribosilanthronilate isomerase disclosed herein. 

In another embodiment, the invention features isolated polypeptides, preferably substantially 
pure preparations, encoded for by the nucleic acid sequences of the invention. Particularly preferred 
are those polypeptides encoded for by the nucleic acid sequences identified by (SEQ. ID. NO. 2), 
(SEQ. ID. NO. 5), (SEQ. ID. NO. 8), (SEQ. ID. NO. 1 1), (SEQ. ID. NO. 13), (SEQ. ID. NO. 15), 

10 (SEQ. ID. NO. 18), (SEQ. ID. NO. 21), (SEQ. ID. NO. 23), (SEQ. ID. NO. 25) or (SEQ. ID. NO. 26) 
or comprising a nucleotide sequence encoding the amino acid sequence encoded by (SEQ ID NO. 3), 
(SEQ. ID. NO. 6), (SEQ ID. NO. 9), (SEQ. ID. NO. 12), (SEQ. ID. NO. 14), (SEQ. ID. NO. 16), 
(SEQ. ID. NO. 19), (SEQ. ID. NO. 22) or (SEQ. ID. NO. 24). In particularly preferred 
embodiments, the subject polypeptides can aid in regulating the production of alkaloids, particularly 

15 nicotine, in plants. In one embodiment, the polypeptide is identical to or similar to the protein 

represented by the amino acid sequences of (SEQ ID NO. 3), (SEQ. ID. NO. 6), (SEQ ID. NO. 9), 
(SEQ. ID. NO. 12), (SEQ. ID. NO. 14), (SEQ. ID. NO. 16), (SEQ. ID. NO. 19), (SEQ. ID. NO. 22) 
or (SEQ. ID. NO. 24). In a preferred embodiment, the polypeptide is encoded by a nucleic acid that 
hybridizes with a nucleic acid represented in. 

20 The polypeptides of the present invention can comprise full length proteins, such as represented 

by (SEQ ID NO. 3), (SEQ. ID. NO. 6), (SEQ ID. NO. 9), (SEQ. ID. NO. 1 2), (SEQ. ID. NO. 14), 
(SEQ. ID. NO. 16), (SEQ. ID. NO. 19), (SEQ. ID. NO. 22) and (SEQ. ID. NO. 24) , or can comprise 
one or more fragments corresponding to one or more particular motifs/domains, or to arbitrary sizes, 
e.g., at least 5, 10, 25, 50, 100, 150, or 200 amino acids in length. 

25 Another aspect of the invention features chimeric genes comprised of a promoter for the genes 

for PMT2, PMT1, PMT3, PMT4, or ADC2. Yet another aspect of the invention features chimeric 
genes or chimeric molecules comprised respectively of the functional gene encoding for or the 
protein PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, SAMS, NADH dehydrogenase and/or 
phosphoribosilanthronilate isomerase. 

30 The invention also concerns isolated and purified promoter regions for tobacco Beta- 

glucosidase and their use in chimeric genes or chimeric molecules. 

Another aspect of the invention involves vectors capable of transporting another nucleic acid to 
which a vector has been linked. Preferably, the vectors comprise the nucleic acid sequences of the 
invention or their complements. 
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The invention also features transgenic plants and their seeds that include (and preferably 
express) a heterologous form of PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, SAMS, NADH 
dehydrogenase and/or phosphoribosilanthronilate isomerase. The present invention also 
encompasses transgenic plants that contain in their genome a chimeric gene construction 
5 incorporating the nucleic acid encoding PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, SAMS, 
NADH dehydrogenase and/or phosphoribosilanthronilate isomerase. Such transgenic plants and their 
seeds may be useful to natively produce enhanced quantities of desirable exogenous proteins, such as 
compounds useful for pharmaceutical purposes, or proteins that provide herbicide resistance. 

Another feature of the invention is the use as probes of the DNA sequences disclosed herein or 
10 oligonucleotide fragments thereof. Probes may be useful to obtain additional gene family members 
or locate homologous genes in tobacco or other plant species. Copies of related genes can be 
obtained from existing genomic libraries or the genomic libraries can be constructed. In one 
embodiment, an isolated DNA sequence comprising about a fifteen to about a twenty-five base pair 
oligonucleotide sequence identical to any consecutive about fifteen to about twenty-five base pair 
15 sequence found in the sequences of the invention is used as a probe. 

Another feature is use of the polypeptides of the invention in an assay, such as an assay to 
identify modulators of enzyme activity in plants. 

Other features and advantages of the invention will be apparent to those of skill in the art. 
The nucleotide and amino acid sequences of the invention are disclosed herein in the Sequence 
20 Listing, text, and the figures. Specific sequences of the invention are provided in the attached 
Sequence Listing and can be understood to represent promoters, nucleic acids, and proteins 
respectively relating to the following proteins: PMT2 (SEQ. ID. NOS. 1, 2, and 3); PMT1 (SEQ. ID. 
NOS. 4, 5, and 6); PMT3 (SEQ. ID. NOS. 7, 8, and 9); PMT4 (SEQ. ID. NOS. 10, 1 1, and 12); 
SAMS (SEQ. ID. NOS. 13 and 14 ); ODC (SEQ. ID. NOS. 15 and 16); ADC1 (SEQ. ID. NOS. 17, 
25 18, and 19); ADC2 (SEQ. ID. NOS. 20, 21, and 22); ADC1 mRNA (SEQ. ID. NOS. 23 and 24); 
NADH dehydrogenase (SEQ. ID. NO. 25); and PAI (SEQ. ID. NO. 26). If only two sequence 
identifiers are provided for a protein, those sequences represent the nucleic acid and the protein 
respectively. If three identifiers are provided, the identifiers represent promoter, genomic or cDNA 
nucleic acid, and peptide sequences, respectively. If only one identifier is provided, it represents a 
30 DNA fragment coding for the protein or portions of it. 

For other reference, the sequences may be found at the following records in GenBank at the 
following Accession Numbers, which records are hereby incorporated in their entirety herein: 
AF126810 (NtPMTl); AF126809 (NtPMT2); AF126811 (NtPMT3); AF126812 (NtPMT4), 
AF1 76908 (NtomPMT)(Nicotiana tomentosiformis); AF76909 (NotoPMT)(Nicotiana otophora); 
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AF127239 (ADC); AF127240 (ADC1); AF127241 (ADC2); AF127242 (ODC); AF233849 (ODC2); 
AF233850 (ODC1); and AF127243 (SAMS). 

The following experimental discussion is presented to better illustrate the invention. 
I. PMT 

5 The present invention features the characterization of four members of the nuclear gene family 

encoding PMT in tobacco N. tabacum. The nucleic acid sequences encoding PMT and the amino 
acid sequences for the proteins are reported herein and can also be found in the DDB J, EMBL, and 
GenBank Nucleotide Sequence Databases under the accession numbers for NtPMTla, NtPMT2, 
NtPMT3, and NtPMT4 at AF126810, AF126809, AF12681 1, and AF126812, respectively. The 
10 complete coding region and immediate 5'- and 3'- flanking regions are characterized. 

As the discussion below shows, all four PMT genes present in the N. tabacum genome are 
expressed in the roots of wild-type plants and differentially regulated in tobacco lines expressing 
either high or low total alkaloid contents. 

15 Materials and Methods 

Plant materials 

Seeds of N. sylvestris, N. otophora, and N. tomentosiformis were obtained from the USDA-ARS 
20 national tobacco germplasm collection (Oxford, NC). N. tabacum cv. Burley 21 and N. tabacum cv. 
Xanthi seeds were kindly provided by Glenn Collins, University of Kentucky. Tobacco plants used 
for DNA isolation were grown in a soihvermiculite mixture in the greenhouse under natural lighting 
conditions. Plants used for RNA extraction were grown in Moltan Plus (Moltan Co., Middleton, 
TN). 

25 

Gel blot analysis of genomic DNA 

Young leaves were collected from greenhouse grown tobacco (N, tabacum cv. Xanthi) plants and 
total genomic DNA was isolated from freshly-harvested tissues using a modification of the CTAB 
30 extraction method (Dellaporta et al., 1983). Approximately 30 jig of total DNA was digested with 
EcoKL, Kpril, or EcoKi and Kpnl and the digestion products separated by electrophoresis through a 
0.75% agarose gel. Restricted and size-fractionated DNA was denatured and transferred to Nytran+ 
nylon membranes (Schleicher and Schuell, Keene, NH) by capillary blotting in 0.4N NaOH 
overnight. Membranes were prehybridized in 0.25M NajHPC^ (pH 7.4), 7% SDS, 1 mM Na^DTA 
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for at least 2 hr, then hybridized overnight at 65 °C in the same buffer with 2-3 x 10 6 cpm/mL of a 
32 P-labeled single-stranded probe (antisense DNA strand). The probe was prepared by the method of 
Bednarczuk et al. (1991) using a primer (Table 1, primer 4) designed from the 3' end of the 
NtPMTla coding region (Exon 8) and the full-length coding region of the NtPMTla cDNA as 
5 template. The NtPMTla cDNA was generated by RT-PCR using synthetic oligonucleotide primers 
based on the N- and C-terminal sequences of the A41 1 cDNA reported by Hibi et ah (1994) and 
RNA template isolated from N tabacum cv. Burley 21 roots. Membranes were washed at a final 
stringency of 0.1 x SSC, 0.1% SDS at 65 °C. Hybridizing bands were visualized by autoradiography 
and/or imaged using a Molecular Dynamics Phosphorlmager (Model 445 SI, Sunnyvale, CA). 

10 

Genomic library construction and phage isolation 

A library of N tabacum cv. Xanthi genomic DNA fragments constructed in EMBL3 was purchased 
from Clontech (Palo Alto, CA) and a total of 1 . 1 x 10 6 recombinant phage were screened by plaque 

15 hybridization using random-primed 32 P-labeled NtPMTla cDNA as probe (Sambrook et aL, 1989). 
Prehybridization, hybridization, and washing conditions were as described above. Positive 
hybridizing phage were plaque purified by subsequent rounds of rescreening and DNA was prepared 
from 18 independently isolated phage. The phage DNA was characterized by restriction analysis and 
DNA gel blot analysis and fragments containing the sequences encoding PMT were subcloned into 

20 pBluescript KS vectors for further analysis. 

Comparison of the hybridizing fragments present in the 18 recombinant phage to the 
hybridization pattern obtained by genomic DNA blot analysis indicated that only three of the PMT 
genes suspected to be present in the N tabacum genome were recovered from the library screen. To 
obtain sequences encoding NtPMTla, a subgenomic library was constructed from N tabacum cv. 

25 Xanthi DNA. The library consisted of gel-purified 2.5-3.5 kb EcoRl fragments ligated into A_ZAP 
II vector arms and packaged using Gigapack III Gold packaging extracts according to the 
manufacturer's instructions (Stratagene, La Jolla, CA). The primary library was amplified once in E. 
coli XL 1 -Blue MRF' strain (Stratagene) and screened as described above, except that a random- 
primed 32 P-labeled NtPMTla cDNA Exon 1 -specific probe was used (Table 1). Exon 1 had 

30 previously been amplified by PCR using primers 1 and 2 (Table 1) and the NtPMTla cDNA as 

template. The recombinant phage that hybridized with the probe was isolated from the sublibrary by 
two more rounds of plaque purification, and the pBluescript phagemid containing the approximate 
3.1 kb EcolU genomic fragment with the NtPMTla gene was excised from the X ZAP II phage 
vector using the in vivo excision protocol described by Stratagene. 
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DNA sequence analysis 

Unless otherwise noted, DNA sequencing was performed with double-stranded plasmid DNA 
5 templates using fluorescent dye terminator technology (dRhodamine Terminator Cycle Sequencing 
Ready Reaction kit) on an ABI 310 DNA sequencer (Perkin-Elmer Applied Biosystems). For 
analysis of PCR products, following electrophoretic separation of amplification reaction products, 
the bands of interest were excised from the polyacrylamide gels, the DNA extracted using the 
Quiagen Gel Extraction Kit, and the recovered DNA used as sequencing template. Sequencing was 

10 performed using AmpliTaq DNA polymerase and fluorescent dye terminator technology (as 

described above) and primers 1 and 2 (Table 1) specific for Exon 1. Nucleotide and amino acid 
sequences were analyzed and aligned using either the Clustal method and Lasergene software 
(DNAStar Inc., Madison, WI) or the PILEUP and ALSCRIPT (Genetics Computer Group, Madison, 
WI) sequence analysis package (Version 9.0). Transcription factor binding site homologies were 

15 identified in promoter DNA sequences by searching the transcription factor database using the GCG 
program. 

RNA gel blot analysis 

20 For RNA analysis, roots and other tissues were harvested from mature wild-type (HP; NiclNic2) and 
low alkaloid mutant (LP; niclnicl) Burley 21 tobacco plants. For topping experiments, the stem was 
cut and the top one-third of the plant was removed just prior to flower opening. Roots were 
harvested just prior to topping (0 hr control) and at various times after decapitation. The tissue was 
immediately frozen in liquid nitrogen and stored at -80 °C until RNA extraction and isolation. 

25 Total RNA was isolated from vegetative organs and floral structures of HP and LP Burley 21 

tobacco using the TRI-reagent (Molecular Research Center Inc., Cincinnati, OH) and quantified 
spectrophotometrically by measuring ,4260. Total RNA (5 ng) was electrophoresed through 1 .2% 
agarose gels (containing 0.4 M formaldehyde) and transferred to Nytran+ nylon membranes. 
Following prehybridization the membranes were hybridized with a single-stranded NtPMTla cDNA 

30 antisense probe (corresponding to the antisense strand of Exons 2 to 8 of the NtPMTla cDNA coding 
region) as described above. As a control to quantify and normalize RNA levels in each lane, the blot 
was hybridized with a 400-bp probe derived from the P-ATPase cDNA using primers 6 and 7 (Table 
1) as described below. 
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Semi-quantitative RT-PCR analysis of individual PMT transcript levels 

Total RNA (1 jig) extracted from the roots of HP and LP Burley 21 tobacco plants was reverse- 
transcribed into first-strand cDNA at 42 °C using Superscript II reverse transcriptase (Gibco BRL) 
5 according to the manufacturer's protocol. Two gene-specific primers were employed in the reactions: 
primer 5 capable of recognizing Exon 3 of the PMT genes and primer 8 specific for Exon 8 of the 
nuclear gene encoding the P-subunit of mitochondrial ATPase from N. plumbaginifolia (NpATP2.J) 
and N. sylvestris (NsATP2.1) (Boutry and Chua, 1985; Lalanne et al., 1998). The P-ATPase 
transcript served as an internal reference (constitutively-expressed control) to determine loading 

10 accuracy and to normalize expression levels (Kinoshita et al., 1992) Following first strand cDNA 
synthesis, two sets of nested primers (0.4 fiM each primer) were used to amplify the PMT and P- 
ATPase transcripts: primers 1 and 2 (Table 1) recognized Exon 1 in all five PMT transcripts and 
gave products ranging in size from 220 bp to 420 bp and primers 6 and 7 amplified an approximately 
400-bp region encompassing a portion of Exons 6 to 8 of the P-ATPase coding region. 

15 Amplification was carried out for 25 cycles using the following reaction conditions: denaturation at 
95 °C for 1 min, primer annealing at 60°C for 35 sec, and extension at 72°C for 1.5 min; a final 
extension was conducted at 72 °C for 6 min. Amplification products were radioactively labeled by 
spiking the PCR reaction with 10 \iCi 32P-dCTP. Aliquots of the PCR reaction were analyzed on a 
6.5% non-denaturing polyacrylamide/lX TBE gel and electrophoresed at 600 volts. The reaction 

20 conditions were optimized to provide amplification of both PMT and $-ATPase transcripts in the 
linear range of the reaction by varying the levels of first strand cDNA template, annealing 
temperature, and number of cycles of amplification as described in Kinoshita et al. (1992). 
Molecular weight standards were prepared by PCR amplification using the same primers and 
protocol described above and plasmid DNA templates containing the PMT encoding genomic 

25 fragments, as well as genomic DNA from the various Nicotiana species indicated in the text. 

Following electrophoresis, the polyacrylamide gels were fixed in 5% MeOH, 7.5% acetic acid 
for 30 min, dried overnight, and used to expose X-ray film. PMT band intensities were quantified 
using phosphorimager analysis (Molecular Dynamics) and normalized relative to the intensities of 
the p-ATPase control band in each lane. The experiment was conducted twice with different total 

30 RNA samples, and representative results are presented from one of the two experiments. 

Results 

PMT gene structure and organization in N. tabacum 
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Gel blot analysis of total genomic DNA isolated from N. tabacum cv. Xanthi, hybridized with a 
radioactively-labeled cDNA {NtPMTla) encoding the complete coding region of putrescine N- 
methyltransferase (PMT) showed the presence of five major hybridizing bands in Kpnl or EcoRI 
digested DNA, consistent with the presence of a small multigene family in the N tabacum genome 
5 (Figure 1). 

As part of our initial characterization of the gene family encoding PMT in N tabacum, an 
EMBL3 genomic library, prepared from N tabacum cv. Xanthi DNA, was screened using the 
NtPMTla (A41 1 homologous) cDNA as probe. From a total of 18 recombinant phage isolated, three 
phage were recovered that contained genomic fragments encoding the NtPMT2, NtPMT3 and 

10 NtPMT4 genes. The three PMT genes were completely encoded within a unique sized EcoRI 

fragment within the phage DNA insert which allowed for the correlation of each with a hybridizing 
restriction fragment on the gel blot of N. tabacum genomic DNA (Figure 1). The complete coding 
region and immediate 5' and 3' non-coding sequences of the three genes were determined and found 
to encode full-length PMT proteins (Figure 2). Each PMT gene consisted of 8 exons and 7 introns, 

15 consistent with the gene structure reported previously for the PMT genes from N sylvestris 

(Hashimoto et aL, 1998a). Comparison of the deduced amino acid sequences (Figure 2) revealed 
that the encoded PMT proteins were extremely similar over their entire length, with the only 
significant variability in primary sequence localized to the extreme N-terminal region of the protein. 
This region, completely encoded within Exon 1, contains a variable number of an 1 1 amino acid 

20 repeat with a consensus sequence of NGHQNGTSEHQ. The function of the repeated sequence is 

unknown, but is apparently inconsequential to enzyme function, since the number of repeats does not 
influence activity and PMTs characterized from other species do not contain the repeated element 
(Hashimoto et aL, 1998a; Suzuki et aL, 1999a). 

Multiple rounds of screening of the EMBL3 genomic library failed to yield additional 

25 hybridizing phage containing sequences encoding the other two PMT genes thought to be present in 
the N. tabacum genome and, therefore, a directed cloning approach was pursued using a subgenomic 
library constructed from EcoRI fragments isolated from N. tabacum cv. Xanthi. From this 
hybridization screening, a phage containing the approximately 3.1 kb EcoRI fragment encoding 
NtPMTla was recovered. The coding region of the NtPMTla gene was found to be identical to the 

30 A41 1 cDNA (Hibi et aL, 1994), with the exception of a single base change in Exon 6 that results in a 
conservative amino acid substitution. This difference could be the result of minor differences among 
cultivars used in the two studies (i.e., Xanthi vs. Burley 21). Translation of the open reading frame 
contained in NtPMTla showed that it encoded a protein containing four N-terminal 1 1 amino acid 
repeats, similar to Exon 1 of the PMT gene present in N. tomentosiformis (Hashimoto et aL, 1998a). 
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Given the observation that NtPMTla encoded a homolog of the PMT gene present in N. 
tomentosiformis, the nature and possible evolutionary origin of the remaining PMT gene present in 
the N tabacum genome was brought into question. From our expression studies (described in detail 
below), we had determined that five distinct PMT encoding transcripts were present in the roots of N. 
5 tabacum, four of which could be accounted for based upon the length of the Exon I coding region in 
the four PMT genes isolated and characterized in our studies described above. The fifth transcript 
was similar in size to that encoded by NtPMTla and, therefore, was designated NtPMTlb. Since the 
variability in PMT gene structure is primarily localized within Exon 1, we used a PCR-based strategy 
to analyze the PMT gene structure and family size in N otophora, the other possible progenitor of N 

10 tabacum. As shown in Figure 3, five distinct PCR products were detected in the electrophoretic 
pattern of amplification products generated from N. tabacum genomic DNA using Exon 1 specific 
primers (Table 1). Consistent with our studies described above and the previous work of Hashimoto 
et al. (1998a), three PCR products were detected in the electrophoretic pattern of amplification 
products generated from N sylvestris genomic DNA, and a single band was recovered from N. 

1 5 tomentosiformis genomic DNA. Amplification of genomic DNA from N otophora using Exon 1 

specific primers also yielded only a single band, whose electrophoretic mobility was most similar to 
that of the NtPMTlb derived product. 

Analysis of PMT gene intron and flanking sequences 

20 

The location of the seven introns within the protein coding region of the five PMT genes in N 
tabacum is identical and appears to be conserved among PMT genes from different Nicotiana 
species. There is also little variation in the nucleotide sequences that comprise the Exon-Intron 
splice junctions in the various PMT genes in N tabacum (Table 2). The high degree of nucleotide 

25 sequence similarity recognized among PMT genes within their coding regions is also present within 
their introns and immediate 5' and 3' flanking sequences (Table 2 and Figure 4). In general, a greater 
level of sequence identity is found in the introns of the NtPMT2, NtPMT3, and NtPMT4 genes, than 
in pair- wise comparisons among the introns of the other members of the N tabacum PMT gene 
family. The observed conservation in the intron sequences of the NtPMT2, NtPMT3, and NtPMT4 

30 genes is consistent with their origin from the same progenitor species (N sylvestris). One interesting 
exception occurs within Intron 6, where the length of the intron and the sequence similarity is more 
conserved between NtPMTla and NtPMT4, than between NtPMT4 and NtPMT2 or NtPMT3. 

Approximately 1 kb of nucleotide sequence was determined 5' to the coding regions of the 
NtPMTla, NtPMT2, NtPMT3, and NtPMT4 genes (Figure 4). By comparison to the S'-untranslated 
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region (UTR) present in the A41 1 cDNA, we set the start site for transcription initiation at 
approximately 57 nucleotides upstream of the MET start codon in NtPMTla and NtPMT3, and either 
69 or 60 nucleotides upstream in NtPMT2 and NtPMT4. The major distinguishing feature between 
the 5-UTRs in the various genes is the presence or absence of a 17 bp sequence in the gene. An 
5 appropriately placed TATA-box can be easily recognized 45 bp 5' to the initiation site in all four 

genes. Within the first 200-250 bp upstream of the TATA box, a high level of sequence conservation 
is found to exist among the promoter regions in the four genes. After this point, a clear difference 
can be observed between the NtPMTla promoter and the remaining three genes, and by 400 bp 
upstream, little similarity can be found among any of the gene family members. 

10 Analyzing the proximal regions of the various PMT promoters with various motif scanning 

software identified several G-box-like sequences (Foster et aL, 1994; Kim et aL, 1992; Menkens et 
aL, 1995; Staiger et aL, 1989; Williams et aL, 1992) at various positions among the PMT promoters, 
and a potential metal response element (MRE) (positions -75 to -66; numbering relative to the 
NtPMTla promoter sequence) in three of the four PMTs (Cizewski-Culotta and Hamer, 1989; Thiele, 

15 1992). An unusual 17 nucleotide stretch of guanine occurs at positions -259 to -243 in the NtPMTla 
gene promoter followed upstream by a purine-rich region (positions -332 to -263). In the NtPMT3 
promoter a 14 bp palindromic sequence (positions -497 to -484) was detected. PMT gene expression 
has been reported to increase in root tissues following treatment with methyl jasmonate (Imanishi et 
aL, 1998). However, none of the sequence motifs reported to confer methyl jasmonate- 

20 responsiveness in other plant genes (Mason et aL, 1993; Rouster et aL, 1997) were detected in the 
PMT promoters. 

Comparison of the available nucleotide sequence information from the 3 '-flanking regions of 
the various PMT genes in N tabacum revealed that the 3'-UTRs in the NtPMT2, NtPMT3, and 
NtPMT4 genes of N tabacum share approximately 81-94% identity with each other and are 

25 essentially identical to those reported for N. sylvestris PMTs by Hashimoto et aL (1998a). The major 
distinguishing feature among the various genes is the presence of two short (20 bp and 4 bp) 
deletions in the NtPMT2 gene, which lowers the percent identity. The 3'-UTR of NtPMTla is 
identical to that reported for the A41 1 cDNA (Hibi et aL, 1994) and 81-94% identical to the other 
PMT genes in the N. tabacum genome. Unfortunately, no sequence information is currently 

30 available for the 3'-UTR of the N otophora or N tomentosiformis PMT genes. 

Regulation of PMT gene expression 



To determine whether the members of the PMT gene family in N tabacum were differentially 
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expressed, a series of experiments were carried out to define the temporal and spatial distribution of 
transcripts arising from the five genes. Shown in Figure 5 A are the results of gel blot analysis of 
total RNA extracted from various tissues of mature Burley 21 tobacco plants hybridized with 
radioactively-labeled probe capable of detecting all five PMT transcripts. Consistent with previous 
5 studies (Hashimoto et al. 9 1998b; Hibi et al, 1994), PMT expression is localized exclusively to roots. 
When maturing wild-type (HP) Burley 21 plants are topped (i.e., the floral meristem and upper 1/3 of 
the stem are removed), a dramatic increase in PMT transcript abundance is observed within 2 hr, 
reaching a maximal level of accumulation by 12-24 hr. Two size transcripts are detected on the gel 
blots, reflecting the small difference in message size that occurs as a result of the difference in size of 

10 Exon 1 among the genes. 

In addition to examining PMT gene expression in wild-type plants, we also examined 
expression in a low nicotine-producing (LP) mutant of Burley 21 (Legg and Collins, 1971). The low 
nicotine Burley 21 line harbors mutations at two independent loci (nicl and nic2) thought to be 
global regulators of gene expression involved in alkaloid formation. As shown in Figure 6B, topping 

15 of the low nicotine mutant {niclnicl) Burley 21 did not cause an increase in PMT transcript 

abundance as observed in wild type plants. Thus, it appears that Nicl and Nicl are likely involved in 
regulation of PMT expression in the very least, and may also be involved in the regulation of other 
genes in the alkaloid biosynthetic pathway. Whether this is a direct effect (e.g., transcriptional 
activation) or indirect remains to be determined. 

20 In order to determine the extent to which the individual members of the gene family 

contributed to the general pattern of expression described above, a semi-quantitative RT-PCR 
strategy (Kinoshita et al. 7 1992) was used to detect and quantify the levels of the individual PMT 
transcripts in the roots of both wild-type (HP) and low alkaloid (LP) Burley 21 tobacco. This 
approach takes advantage of the fact that Exon 1 is variable in length among the various PMT genes 

25 (Figure 2), allowing for their individual detection and quantitation following polyacrylamide gel 
electrophoresis and autoradiography. 

Five RT-PCR products (representing Exon 1 from each of the five genes present in N. tabacum) 
were detected in the electrophoretic profiles of amplification products derived from reactions using 
either HP or LP Burley 21 root RNA (Figure 6 A). All five PMT genes present in the N. tabacum 

30 genome were expressed in the roots of wild-type plants, and topping resulted in a differential 

accumulation of transcripts derived from each gene. Among the five genes, transcripts derived from 
the NtPMTl and NtPMTlb showed the greatest increase in abundance rising approximately 3-fold 
during the first 24 hr post-topping, whereas levels of the NtPMTl a and NtPMT4 transcripts changed 
little in response to topping (Figure 6B). In the LP mutant, little or no effect was observed on the 
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levels of the various PMT transcripts following topping, although in some cases (e.g., NtPMTld) a 
small but likely insignificant depression in transcript abundance was detected. Thus, it appears that 
all five genes contribute to PMT activity levels within the root. 

5 II. ADC 

The present invention features the characterization of two members of the nuclear gene family 
encoding ADC in tobacco N. tabacum L. As the following discussion shows, ADC2 is preferentially 
expressed in roots and accounts for the major portion of ADC transcripts present. Furthermore, 
analysis of ADC transcript levels in roots of low and high nicotine producing lines showed that ADC 
10 expression is under the control of the Nicl Nic2 regulatory loci. 

Materials and methods 

Plant growth and tissue preparation 

15 

Seeds of N. tabacum cv. Xanthi, wild-type and low alkaloid nicl nic2 mutant N. tabacum cv. Burley 
21 were obtained from Dr. G. Collins (University of Kentucky, Lexington). Tobacco plants used for 
DNA isolation were grown in soilrvermiculite mixture in the greenhouse under natural lighting 
conditions. Plants used for RNA extraction were grown either in Moltan Plus (Moltan Co., 

20 Middleton, TN) or hydoponically in a dilute (half-strength) Peters nutrient solution with continuous 
aeration of the roots under natural lighting conditions in the greenhouse. Topping experiments were 
conducted by removing the floral meristem, leaves and stem (approximately the upper 1/3 of the 
plant) from tobacco plants just prior to blooming. Plant tissues were collected from fully matured 
individuals, frozen in liquid nitrogen, and stored at -80 °C until used for RNA preparation (see 

25 below). 

Screening of genomic libraries and phage characterization 

A genomic library constructed in X EMBL3 from N. tabacum cv. Xanthi leaf DNA (Clonetech, Inc., 
30 Palo Alto, CA) was screened by plaque hybridization (Sambrook et aL, 1989) using an [a- 32 P]- 

dCTP-labeled, 2.7 kb EcoKL-Xhol fragment from plasmid PR24 as probe. PR24 encodes a full length 
ADC cDNA isolated from the roots of wild-type N. tabacum cv. Burley 21 (Wang, 1999). 
Hybridization was performed at 65 °C for 16 h in a solution containing 0.25 M Na2HP0 4 (pH 7.2) 
and 7% (w/v) SDS. Following hybridization, the membranes were washed twice in 2 x SSC, 0.1% 
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SDS for 15 min at room temperature, once in 0.2 x SSC, 0.1% SDS for 30 min at 65°C. Hybridizing 
phage were picked and plaque purified through three subsequent rounds of hybridization screening. 
Phage DNA was isolated from plaque purified phage using a Qiagen Phage Midi Preparation Kit 
(Qiagen, Germany) and insert DNA characterized by restriction mapping and DNA gel blot analysis. 
5 The relevant hybridizing bands in each phage were cloned into pBluescript SK+ vectors for further 
analysis. 

Nucleic acid sequencing and analysis 

10 Nucleotide sequencing was carried out manually using the Sequenase Version 2.0 protocols 

according to the manufacturer's protocol (United States Biochemical, Cleveland, OH) or with an ABI 
310 Genetic Analyzer (PE Applied Biosystems, Foster City, CA) using double-stranded plasmid 
DNA templates prepared utilizing the Qiaprep Spin Plasmid Kit (Qiagen USA, Valencia, CA). The 
nucleotide and predicted amino acid sequences of the various cDNAs were analyzed using BLAST 

15 sequence analysis programs (Altschul et al. 9 1990; Gish and States, 1993) and protein sequence 
alignments were carried out using the PILEUP program (Genetics Computer Group Sequence 
Analysis package, Version 9.0 (GCG, University of Wisconsin, Madison, WI) and the various gene 
sequences available in the NCBI (National Center for Biotechnology Information, Bethesda, MD) 
nucleotide and protein sequence database. Manual adjustment of the sequence alignments were 

20 carried out as necessary. 

RNA isolation and gel blot analysis 

Total RNA was extracted from tobacco roots, leaves, and floral parts using Tri-Reagent 
25 (Molecular Research Center, USA, Cincinnati, OH) according to the manufacturer's protocol. For 
RNA gel blot analysis, aliquots (10 ng) of total RNA extracted from the various tissues were 
fractionated by electrophoresis through a 1.2% agarose-formaldehyde gel and blotted onto Nytran 
nylon membranes (Schleicher & Schuell, Keene, NH) using 10 X SSC. The transferred RNA was 
UV cross-linked to the membrane using a UV Stratalinker (Stratagene, La Jolla, CA) and the 
30 membranes were prehybridized in 7% SDS, 0.25 M Na^PO^ pH 7.2 for 2-4 hours at 65 °C. 

Hybridization was carried out in the same buffer in the presence of 32 P-labeled probes for 16 hr at 
65 °C. The membranes were washed under high stringency conditions and subject to 
autoradiography at -80°C for approximately 48 h. 

For gel blot analysis, [a- 32 P]-dCTP -labeled probes were prepared by random primed labeling 
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(Random Primed Labeling Kit, Boehringer Mannheim, Indianapolis, IN) using 25-50 ng of a 2.7 kb 
EcoRl-Xhol fragment derived from PR24 and a 460 bp fragment amplified from the P- subunit of the 
tobacco mitochondrial ATP synthase gene (atp2) (Boutry and Chua, 1985). 

5 Semi-quantitative RT-PCR analysis of NtADCl and NtADC2 transcript levels. 

Total RNA (2 p,g) from roots, leaves, or floral parts was reverse transcribe at 40°C for 1 h in a 
reaction cocktail containing 200 units of Superscriptll reverse transcriptase (RNase H-, Gibco BRL, 
USA), 10 units RNase inhibitor (Perkin Elmer), 200 \im dNTPs and 40 pmol of primer, in total 

10 volume of 20^,1. For first strand cDNA synthesis, a single primer [5'- 

AGAAAAACATC ACCAACT-3 '] capable of hybridizing to both the ADC1 andADC2 transcripts 
was used in the reaction. As a control, a primer ( 5 '-GC AACTGTC ATCTTATCATCTTC-3 ') 
specific for the P-subunit of the tobacco mitochondrial ATP synthase gene apt! (Boutry and Chua, 
1985) was used in the reverse transcriptase reaction. 

15 Following reverse transcription, the single stranded cDNA products were serially diluted over a 

concentration range between 1 to 50 ng RNA, and PCR amplification was carried out for 25 cycles 
of 45 s at 94 °C, 1 min at 64 °C and 1 min at 72° C in a Genemate thermocycler (ISC Bioexpress, 
UT). The reaction mixture contained cDNA template, 1 x PCR buffer (Boehringer Mannheim), 100 
|iM dNTPs, 25 pmol of each forward and reverse primer and 1 unit Taq DNA polymerase. The PCR 

20 reactions specific for ADC1 transcripts contained the following primers: ADC 1 -forward, 5'- 
CGTAGACGCTACTGTTTC-3 ' and ADCl-reverse, 5'-TGGACAAC TGTGGAGGCG-3 * . 
Reactions specific for ADC2 transcripts contained primers ADC2-forward, 5'- 
TGTAGATGCTGCTGTTGTTT-3 and ADC2-reverse, 5'-TGAACAAC TGCGGAGGC A-3 ' . 
Control reactions for normalization of amplification products contained 25 pmol of primers specific 

25 for the tobacco apt2 transcripts: atp2 forward, 5 '-GTATATGGTCAAATGAATGAGCC-3 1 , and atp2 
reverse.int, 5 '-GCAGTATTGTAGTGATCCTCTCC-3\ For quantitation purposes, amplification 
reactions were supplemented with l^xCi 32 P-dCTP. PCR products were separated by electrophoresis 
through 1.2% agarose gels, the fractionated reaction products transferred onto a Hybond N+ 
membranes, dried and subject to autoradiography at -70° C. Quantitation was carried out by 

30 phosphorimaging using a Molecular Dynamics Phosphorlmager. Values were normalized relative to 
the intensities of the atp2 control band in each lane. The experiment was conducted twice with 
different total RNA samples, and representative results are presented from one of the two 
experiments. 
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Results 

These studies show the structure and expression of individual members of the ADC gene 
family in tobacco. An a- 32 P-dCTP-labeled 2.7 kb EcoRI-XhoI fragment from PR24 encoding the 

5 ADC coding region was used to screen an X EMBL3 phage genomic library. From a screen of 

approximately 3 X10 5 phage, seventeen hybridizing phage were recovered, of which five were fully 
characterized by restriction mapping and DNA gel blot analysis. These phage fell into two groups 
based on their restriction profile. The relevant hybridizing fragments from the various phage were 
cloned into pBluescript and their nucleotide sequence determined. 

10 Presented in Figure 7 are the nucleotide and predicted amino acid sequences of NtADC-1 and 

NtADC-2 genes. Both genes contain a single open reading frame, uninterrupted by introns. The 
nucleotide and amino acid sequence encoded in NtADC-1 is identical to that of PR24, the full length 
cDNA isolated from N. tabacum cv Burley 21. There are 84 nucleotide differences within the 
NtADC-1 and NtADC-2 coding regions, resulting in 23 amino acid differences between the ADC1 

15 and ADC2 proteins, respectively. The ADC1 protein is one amino acid shorter in length, missing 
Val-13. 

By comparison to the full-length cDNA, the 5 '-untranslated region (UTR) present in 
NtADC-1 and NtADC-2 are 431 bp and 432 bp long, respectively. The size of the 5'-UTR in the 
ADC transcripts is considerably larger than the average size of the plant leader sequence (Joshi, 
20 1987). In contrast, the 3'-UTRs present in NtADC-1 and NtADC-2 are relatively short, 

approximately 84 nucleotides in length. In both gene sequences, a conserved polyadenylation signal 
(AATAATA) can be recognized 23 nucleotides from the site of polyadenylation site found in the 
PR24 cDNA. 

Pairwise comparison of the N. tabacum ADC1 and ADC2 proteins with the ADCs of other 
25 plant species showed that the N. tabacum proteins are approximately 82% identical to the ADC of 
its evolutionary progenitor species N. sylvestris [Genbank Accession No. ABO 12873] and 86% 
identical to the ADC from tomato (Lycopersicon esculentum) [31], another member of the 
Solanaceae family (Figure 2). As might be expected, the N. tabacum ADC shares considerably less 
similarity to ADCs isolated from species more distantly related evolutionarily, such as Arabidopsis - 
30 67% identical [32, 33], soybean- 67% identical [34], and oat - 42% identical [35] and is only 29% 
identical to the enzyme from Escherichia coli - [36]. 

The predicted protein coding regions for the N. tabacum ADCs are substantially longer than 
those reported for the ADC proteins of N. sylvestris and L. esculentum [31], but are similar in length 
to those reported in Arabidopsis, oat, soybean [32-35] and for the E. coli enzyme [36]. The 
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difference in overall length appears to arise from an apparent nucleotide deletion in the N. sylvestris 
and tomato cDNA sequences relative to the ADC1 and ADC2 predicted sequence and those in other 
plants. In the nucleotide sequences reported for both the N. sylvestris and tomato cDNAs, a guanine 
residue (position 2295 in the N. sylvestris sequence and 1531 in the tomato sequence) is missing 

5 [Genbank Accession No. AB012873]. This deletion changes the reading frame and introduces a 
premature termination to the predicted coding region. Using the sequence information available in 
the NCBI database, correcting for this error allowed us to extend the predicted C-terminus of the 
both ADC proteins, yielding the alignment to the N. tabacum ADCs and those of other plant ADCs 
as indicated in Figure 8. We have also included in the alignment shown in Figure 8, the correction at 

10 the N-terminus of the predicted tomato ADC protein sequence noted by Perez- Amado et al. [37], 
allowing better alignment of all of the higher plant sequences. 

Developmental regulation of arginine decarboxylase expression 

15 It has been shown that nicotine formation can be activated in the roots of maturing tobacco 

plants by topping, that is, removal of the flower head and several young leaves (Akehurst, 1981; 
Hibi, et al., 1994). Coincident with the activation of nicotine formation, there is an increase in the 
levels of transcripts encoding ODC, PMT and spermidine synthase (SPS) over the subsequent 24 hr 
period in wild-type plants (Hibi et al., 1994; Riechers and Timko, 1999). To determine the effects of 

20 topping on ADC expression in roots, Burley 21 plants were grown in the greenhouse to the bud stage 
at which point the upper 1/3 of the plant was removed and samples of roots tissues were collected 
before and at various times post-topping. As shown in Figure 9, ADC message abundance increased 
in the roots of topped Burley 21 plants during the 24 hr period after topping. Low alkaloid (LA) 
mutants of Burley 21 show a much lower level of ADC expression in their roots, and no induction of 

25 ADC transcript accumulation after topping. The lack of ADC induction in the low-alkaloid mutant is 
consistent with previous studies (Hibi et al, 1994; Riechers and Timko, 1999; Wang, 1999) showing 
a general inability to activate gene expression leading to increased polyamine formation and alkaloid 
biosynthesis as a result of the mutation of the Nicl and Nic2 regulatory genes. 

30 NtADC-2 is predominately expressed in roots of wild-type plants. 

Due to the high degree of identity between the NtADC-1 and NtADC-2 transcripts (e.g., 95.8% 
coding regions, 94.4% and 96.4% in 5'- and 3'-UTRs, respectively), it is impossible to distinguish 
between the two transcripts by RNA gel bot analysis. Therefore, we employed a RT-PCR based 
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strategy and gene specific oligonucleotide primers. Total RNA was extracted from tobacco roots, 
leaves and flowers, and single-stranded cDNA synthesized using an oligonucleotide primer capable 
of hybridizing to both ADC1 and ADC2 transcripts. As an internal control for amplification, a gene 
specific primer recognizing the atp2 transcript encoding the P-subunit of the tobacco mitochondrial 
ATPase was include in the reactions. Under experimental conditions providing amplification in the 
linear range of the PCR reaction, gene specific forward and reverse primers were used to specifically 
amplify either ADC1 or ADC2 cDNAs. Test reactions (Figure 10A) using plasmid DNA encoding 
NtADCl or NtADC2 as template demonstrated the specificity of the primers. As shown in Figure 
10B, the main transcripts detectable in all tissues tested are derived from NtADC-2. Flowers express 
the highest level of ADC, and leaves lowest. In the flowers, although ADC1 is detectable, far less 
than ADC2 Roots also express a significant level of ADC. 

ADC transcript levels are highest in the roots and floral organs, and low in other plant tissues. 
The two ADC genes investigated appear to have different modes of regulation, with ADC2 being 
predominately expressed in the roots and other organs. 

At the present time, only limited information is available on the nature of regulatory regions 
in the promoters of genes encoding enzymes of alkaloid biosynthesis. The availability of cloned 
genomic fragments encoding ADC allows one to begin mapping regulatory sequences within 
members of these genes responsible for tissue specific, developmental, and inducible expression. 



III. ODC 

The present invention features the genes of two members of the nuclear gene family encoding 
ODC in tobacco N. tabacum. As the following experimental discussion shows, the ODC-2 gene is 
preferentially expressed in roots and floral tissues. Furthermore, the abundance of ODC transcripts in 
root tissues is affected by topping. Furthermore, analysis of ODC transcript levels in roots of low 
and high nicotine producing lines shows that ODC expression is under the control of the Nicl Nic2 
regulatory loci. 
Materials and methods 

Plant growth and tissue preparation 

Seeds of N. tabacum cv. Xanthi, wild-type and low alkaloid nicl nic2 mutant N. tabacum cv. Burley 
21 were obtained from Dr. G. Collins (University of Kentucky, Lexington). Tobacco plants used for 
DNA isolation were grown in soikvermiculite mixture in the greenhouse under natural lighting 
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conditions. Plants used for RNA extraction were grown either in Moltan Plus (Moltan Co., 
Middleton, TN) or hydroponically in a dilute (half-strength) Peters nutrient solution with continuous 
aeration of the roots under natural lighting conditions in the greenhouse. Topping experiments were 
conducted by removing the floral meristem, leaves and stem (approximately the upper 1/3 of the 
5 plant) from tobacco plants just prior to blooming. Floral parts and other plant tissues were collected 
from fully matured individuals, frozen in liquid nitrogen, and stored at -80 °C until used for RNA 
preparation (see below). 

Screening of genomic libraries and phage characterization 

10 A genomic library constructed in EMBL3 from N. tabacum cv. Xanthi leaf DNA (Clonetech, Inc., 
Palo Alto, CA) was screened by plaque hybridization (Sambrook et al, 1989) using a 32 P- 
radiolabeled, 1.6 kb EcoKL-XhoI insert from plasmid PR46 as probe. PR46 encodes a full length 
ODC cDNA previously isolated by differential screening of plasmid libraries prepared from mRNA 
isolated from the roots of wild-type Burley 21 plants before and 3-days post-topping (Wang, J., 

15 Sheehan, M., Bookman, H. and Timko, M.P., unpublished data). Hybridization was performed at 
65 °C for 16 h in a solution containing 0.25 M Na^PC^ (pH 7.2) and 7% (w/v) SDS. Following 
hybridization, the membranes were washed twice in 2 x SSC, 0.1% SDS for 15 min at room 
temperature, once in 0.2 x SSC, 0.1% SDS for 30 min at 65 °C. Hybridizing phage were picked and 
plaque purified through three subsequent rounds of hybridization screening. Phage DNA was isolated 

20 from plaque purified phage using a Qiagen Phage Midi Preparation Kit (Qiagen USA, Valencia, CA) 
and insert DNA characterized by restriction mapping and DNA gel blot analysis. The relevant 
hybridizing bands in each phage were cloned into pBluescript SK+ vectors for further analysis. 

Nucleic acid sequencing and analysis 

25 Nucleotide sequencing was carried out manually using the Sequenase Version 2.0 protocols 

according to the manufacturer's protocol (United States Biochemical, Cleveland, OH) or with an ABI 
310 Genetic Analyzer (PE Applied Biosystems, Foster City, CA) using double-stranded plasmid 
DNA templates prepared utilizing the Qiaprep Spin Plasmid Kit (Qiagen USA, Valencia, CA). The 
nucleotide and predicted amino acid sequences of the various cDNAs were analyzed using BLAST 

30 sequence analysis programs (Altschul et al. y 1990; Gish and States, 1993) and protein sequence 
alignments were carried out using the PILEUP program (Genetics Computer Group Sequence 
Analysis package, Version 9.0 (GCG, University of Wisconsin, Madison, WI) and the various gene 
sequences available in the NCBI (National Center for Biotechnology Information, Bethesda, MD) 
nucleotide and protein sequence database. Manual adjustment of the sequence alignments were 
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carried out as necessary. 

RNA isolation and gel blot analysis 

Total RNA was extracted from tobacco roots, leaves, and floral parts using Tri-Reagent 
5 (Molecular Research Center, USA, Cincinnati, OH) according to the manufacturer's protocol. For 
RNA gel blot analysis, aliquots (10 jig) of total RNA extracted from the various tissues were 
fractionated by electrophoresis through a 1.2% agarose-formaldehyde gel and blotted onto Nytran 
nylon membranes (Schleicher & Schuell, Keene, NH) using 10 X SSC. The transferred RNA was 
UV cross-linked to the membrane using a UV Stratalinker (Stratagene, La Jolla, CA) and the 

10 membranes were prehybridized in 7% SDS, 0.25 M Na 2 HP0 4 , pH 7.2 for 2-4 hours at 65 °C. 

Hybridization was carried out in the same buffer in the presence of 32 P -labeled probes for 1 6 hr at 
65°C. The membranes were washed under high stringency conditions and subject to 
autoradiography at - 80° C for approximately 48 h. 

Restriction fragments derived from cDNA clones of interest were separated by agarose gel 

15 electrophoresis, the DNA was purified, and quantified by spectrophotometry. [ 32 P]-dCTP -labeled 
probes were prepared from 25-50 ng of insert DNA by random primed labeling (Random Primed 
Labeling Kit, Boehringer Mannheim, Indianapolis, IN). As a control, the blots were also probed with 
radioactively labeled probes encoding the alkaloid biosynthesis enzyme putrescine N- 
methyltransferase (PMT) (Riechers and Timko, 1999), a root specific, topping inducible p- 

20 glucosidase encoding cDNA (TBG-1) (Riechers, D.E. and Timko, M.P., unpublished data), 26S 
rRNA (PR31) or 28 S rRNA fragments. 

Genomic DNA isolation and gel blot analysis 

Tobacco genomic DNA was prepared from tobacco leaf tissue by the method of Junghans and 

25 Metzlaff (1990). Total genomic DNA (15 ^ig) was digested to completion with EcoRl or Hindlll, the 
digestion products were fractionated by electrophoresis through a 0.8% (w/v) agarose gel, and 
transferred onto Nytran nylon membrane (Schleicher & Schuell, Keene, NH) in the presence of 0.4 N 
NaOH (Sambrook et al. 9 1989). Following transfer, the membrane was rinsed in 2 X SSC , the DNA 
was UV cross-linked to the membrane, and the membrane was prehybridized and hybridized as 

30 described above. Following hybridization and washing, the membranes were subjected to 
autoradiography at -80 °C. 

Results and discussion 
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Gel blot analysis of tobacco genomic DNA cut with various restriction enzymes and 
hybridized with an [a- 32 P]-dCTP-labeled 1.6 kb EcoW-XhoI cDNA fragment (PR46) encoding the 
full-length ODC protein from N. tabacum cv Burley 21 (Wang, J., Sheehan, M., Bookman, H. and 
Timko, M.P., unpublished data) indicated ODC is encoded by small gene family in the N. tabacum 
5 genome (Fig. 11). Four to five major bands and several minor bands of sufficient size to encode full- 
length genes are detected in either EcdRl or HindlU digested tobacco DNA. 

To further analyze the structure and regulation of members of the ODC gene family in 
tobacco, a X EMBL3 phage genomic library constructed with DNA from TV. tabacum cv Xanthi was 
screened using a [a- 32 P]-labeled probes prepared from PR46 (as described above). From a screen of 
10 approximately 3 X10 5 phage, five hybridizing phage were recovered, of which three were fully 
characterized by restriction mapping and DNA gel blot analysis. Two phage proved to contain 
identical insert DNA and the third had a unique restriction digestion profile. Following DNA gel blot 
analysis, the hybridizing fragments were cloned into pBluescript and their nucleotide sequence 
determined. 

15 The complete NtODC-2 gene spans two Sail fragments of 2.7 kb and 6.5 kb. The coding 

region of the gene contains a single 13 02 bp open reading frame uninterupted by introns (Fig. 12). 
The nucleotide sequences of NtoDC-2 is identical within the coding and 5* and 3'- untranslated 
regions to the PR46 encoded cDNA, with the exception of four nucleotide changes (residues +2, +4, 
+6 and +8) in the S'-untranslated region. These nucleotide differences likely reflect changes 

20 introduced during the cDNA synthesis reaction. 

The predicted amino acid sequence for the NtODC-2 encoded protein (designated pODC2) 
(Fig. 13) is identical to the ODC characterized from Burley 21 tobacco encoded by PR46 (Wang, J., 
Sheehan, M., Bookman, H. and Timko, M.P., unpublished data) and to the partial N. tabacum ODC 
cDNA sequence (PR17) reported by Malik et aL, (1996). Comparison of the predicted amino acid 

25 sequence for pODC2 with the ODC proteins characterized from two different tobacco cultivars 

showed that the pODC2 differs by 7 amino acid (98% identity) from the ODC protein characterized 
from the high alkaloid cultivar, K tabacum cv. SC58 [Genbank Accession No. Y 10472.1] and by 7 
amino acid (98% identity) from ODC protein from BY-2 cells. The tobacco pODC2 is 89% and 
90% identical to the ODCs from tomato (Lycopersicon esculentum) and jimsonweed {Datura 

30 stramonium), respectively, but substantially less similar to ODCs from yeast (35% identity) and 
humans (32% identity). 

The NtODC-1 gene, contained on an 4.0 kb Xbal fragment, encodes a single open reading 
frame of 141 amino acids encompassing the amino terminal one-half of ODC (Fig. 12). Six amino 
acid residue changes distinguish the NtODC-2 and NtODC-1 encoded proteins over the homologous 



WO 00/67558 



PCT/US00/12450 



29 

region of the proteins. Beginning at amino acid residue 130, the NtODC-1 encoded protein (pODCl) 
diverges from pODC2, with a stop codon present after residue 141. Scanning the available 
nucleotide sequence (> 1 kb) in the 3'-flanking region of the NtODC-1 gene failed to reveal any 
evidence for ODC homologous protein sequences in any of the three translational reading frames. 
5 Interestingly, a comparison of the 5 f -flanking sequence of the NtODC-1 and NtODC-2 genes 

revealed that while the NtODC-2 gene has a clearly recognizable TATA-box properly located at 
approximately -35 bp from the transcriptional start site, no such regulatory motif is found in the 
NtODC-1 gene sequence. Consistent with this observation, RNA gel blot analysis performed using a 
hybridization probe prepared from NtOCD-1 immediately downstream of the frame shift, failed to 

10 detect any message in various tissues of mature tobacco plants (data not shown). Thus, it appears that 
NtODC-2 represents an unexpressed pseudogene in the N. tabacum genome. 

To determine the spatial pattern of expression of the NtODC-2 gene, gel blot analysis was 
carried out using total RNA prepared from roots, stems, young and mature leaves, and various floral 
parts of Burley 21 tobacco plants. As shown in Fig 14, transcripts encoding ODC were easily 

15 detected in the roots, with little or no expression in other tissues except sepals, carpels, and mature 
stamens. 

The formation of nicotine and total leaf alkaloids in tobacco is known to be under the control 
of at least two independent genetic loci (Legg et al, 1969; Legg and Collins, 1971), designated Nicl 
and Nic2 (Hibi et al, 1994). Nicl and Nic2 are semi dominant and operate synergistically to control 

20 plant alkaloid content, with mutations within these genes resulting in plants with reduced levels of 
nicotine and total leaf alkaloids (wild-type > nicl > nic2 > nicl nicl) (Legg et al, 1969; Legg and 
Collins, 1971). Although no information is available on the nature of their encoded products, it has 
been speculated that Nicl and Nic2 likely encode transcriptional regulators capable of globally 
interacting with a subset of genes encoding components of polyamine and alkaloid biosynthesis 

25 (Hibi et al , 1994). Removal of the flower head and several young leaves (i.e., topping) leads to 

activation of nicotine formation in the roots of decapitated plants (Akehurst, 1981; Hibi et al, 1994). 
To determine the effects of topping on NtODC-1 expression in roots, Burley 21 plants were grown in 
the greenhouse to the bud stage at which point the upper 1/3 of the plant was removed and samples 
of roots tissues were collected before and at various times post-topping. As shown in Fig 14B, low 

30 levels of the ODC transcripts were found in roots prior to topping and message abundance increased 
approximately 2-fold in the roots of topped Burley 21 plants 4 hr after topping. By 24 hr after 
topping, ODC transcript levels return to their initial levels. Low alkaloid mutants of Burley 21 
subjected to the same treatment show a much lower level of stimulation of ODC transcript 
accumulation after topping, and the enhanced transcript abundance does not persist beyond 4 hr. By 
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comparison, transcripts encoding PMT and and a tobacco root-specific P-glucosidase (TBG-1) show 
patterns of accumulation similar to that observed for ODC transcripts in wild-type plants, but no 
induction in the low-alkaloid mutant, consistent with previous studies (Hibi et al., 1994; Riechers 
andTimko, 1999; Wang, 1999). 

5 

IV. SAMS 

A single recombinant phage is identified as encoding for SAMS. This X phage contains an 
approximately 15kB Sail insert. Restriction mapping and PCR analysis indicates that the insert 
DNA contains primarily the coding and 3 'non-coding portions of the SAMS gene. The nucleotide 
10 sequences for the gene encoding SAMS can be found at GenBank Accession Nos. AF27243 (full 
length SAMS cDNA). 

V. NADH dehydrogenase 

A fragment of the cDNA encoding for NADH dehydrogenase in N. tabacuum shows high 
15 expression in the roots of mature wild-type HP plants compared to low alkaloid mutant LP plants. 

VI. Phosphoribosylanthranilite isomerase (PAI) 

The gene encoding for a fragment of phosphoribosylanthranilite isomerase in N. tabacuum is 
a homolog of the Arabidopsis thaliana gene encoding PAI, an enzyme involved in tryptophan 
20 biosynthesis. This enzyme is involved in the overall formation of aromatic compounds in plants. 
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What is claimed is: 

I. An isolated DNA molecule comprising the nucleotide sequence of (SEQ. ID. NO. 2), (SEQ. ID. 
NO. 5), (SEQ. ID. NO. 8), (SEQ. ID. NO. 1 1), (SEQ. ID. NO. 13), (SEQ. ID. NO. 15), (SEQ. ID. 

5 NO. 18), (SEQ. ID. NO. 21), (SEQ. ID. NO. 23), (SEQ. ID. NO. 25) or (SEQ. ID. NO. 26) or 

comprising a nucleotide sequence encoding the amino acid sequence encoded by (SEQ ID NO. 3), 
(SEQ. ID. NO. 6), (SEQ ID. NO. 9), (SEQ. ID. NO. 12), (SEQ. ID. NO. 14), (SEQ. ID. NO. 16), 
(SEQ. ID. NO. 19), (SEQ. ID. NO. 22) OR (SEQ. ID. NO. 24). 

10 2. A vector comprising the isolated DNA molecule of claim 1 operably linked to sequences capable 
of directing the transcription of a mRNA encoded by said isolated DNA molecule. 

3. An isolated DNA molecule comprising a DNA sequence complementary to the nucleotide 
sequence of claim 1 . 

15 

4. A vector comprising the isolated DNA molecule of claim 3 operably linked to sequences capable 
of directing the transcription of a mRNA encoded by said isolated DNA molecule. 

5. A cultured transgenic tobacco cell stably transformed with the vector of claim 2. 

20 

6. A cultured transgenic tobacco cell stably transformed with the vector of claim 4. 

7. A transgenic tobacco plant stably transformed with the vector of claim 2. 

25 8. A transgenic tobacco plant stably transformed with the vector of claim 4. 

9. The isolated DNA molecule of claim 1, wherein the isolated DNA molecule comprises the 
nucleotide sequence of (SEQ ID NO:). 

30 10. A vector comprising the isolated DNA molecule of claim 9 operably linked to sequences capable 
of directing the transcription of a mRNA encoded by said isolated DNA molecule. 

I I. An isolated DNA molecule comprising a DNA sequence complementary to the nucleotide 
sequence of the isolated DNA molecule of claim 9. 
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12. An isolated DNA sequence comprising about a fifteen to about a twenty-five base pair 
oligonucleotide sequence identical to any consecutive about fifteen to about twenty-five base pair 
sequence found in (SEQ. ID. NO. 2), (SEQ. ID. NO. 5), (SEQ. ID. NO. 8), (SEQ. ID. NO. 1 1), 
(SEQ. ID. NO. 13), (SEQ. ID. NO. 15), (SEQ. ID. NO. 18), (SEQ. ID. NO. 21), (SEQ. ID. NO. 23), 

5 (SEQ. ID. NO. 25) or (SEQ. ID. NO. 26). 

13. A cultured transgenic tobacco cell stably transformed with the vector of claim 10. 

14. A transgenic tobacco plant stably transformed with the vector of claim 10. 

10 

15. A vector comprising a DNA sequence which encodes an antisense mRNA which is 
complementary to a fragment of a mRNA encoded by the isolated DNA molecule of claim 1, 
wherein said sequence is operably linked to sequences capable of directing the transcription of said 
antisense mRNA in tobacco cells and wherein the expression of said antisense mRNA in tobacco 

15 cells is sufficient to provide for reduced nicotine content in tobacco cells which are stably 
transformed with said vector as compared to untransformed tobacco cells. 

16. A cultured transgenic tobacco cell stably transformed with the vector of claim 15. 

20 17. An isolated and purified protein comprising the amino acid sequence identified in (SEQ ID NO. 
3), (SEQ. ID. NO. 6), (SEQ ID. NO. 9), (SEQ. ID. NO. 12), (SEQ. ID. NO. 14), (SEQ. ID. NO. 16), 
(SEQ. ID. NO. 19), (SEQ. ED. NO. 22) or (SEQ. ID. NO. 24). 

18. A method for regulating gene expression in a plant comprising functionally linking an alkaloid 
25 gene promoter to a nucleic acid encoding a protein, wherein the promoter comprises a nucleic acid 

sequence selected from the group consisting of the sequences identified in (SEQ ID NO. 1), (SEQ. 
ID. NO. 4), (SEQ ID. NO. 7), (SEQ. ID. NO. 10), (SEQ. ID. NO. 17),and (SEQ. ID. NO. 20). 

19. The method of claim 18, wherein the nucleic acid encoding a protein encodes a protein involved 
30 in the biosynthesis of alkaloids in plants. 

20. A plant transformed by the method of claim 18. 
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laau ^aa laaiaauuiiia a aiiiaifl im]iacjamiiiiiiM13^3ZM3^! 
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y ^ ^ 

,<*> 



4ift bp 



^NtPMT-lb - 



- NtPNltZ 
ZZO Ot> — — ' 
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NtPMT3 
NtPMT2 
NtPMT4 
NtPMTla 



NtPMT3 
NtPMT2 
NtPMT4 
NtPMTla 

NtPMT3 
NtPMT2 
NtPMT4 
NtPMTla 

NtPMT3 
NtPMT2 
NtPMT4 
NtPMTla 

NtPMT3 
Nt PMT2 
NtPMT4 
NtPMTla 

NtPMT3 
NtPMT2 
NtPMT4 
NtPMTla 



NtPMT3 
NtPMT2 
NtPMT4 
NtPMTla 



NtPMT3 
NtPMT2 
NtPMT4 
NtPMTla 

NtPMT3 
NtPMT2 
NtPMT4 
NtPMTla 

NtPMT3 
NtPMT2 
NtPMT4 
NtPMTla 



-gc tgtacaaaag gatgtctcaa atcatttgga atattaattc -102 9 
— ctgagttg -1039 





















tgcaatcaac 
acaagaacaa 


aagaaatacc 
ttcctggtga 


ccactattaa 
atcagatgga 


gacccattat 
tgaagataat 


cactggcaca 
agaggtgggt 


aaaattatga 
ggaatctata 


O-box 

gatcattaaa catcttaaac 
accaaapcag ctggjttgagt 


-949 
-959 




















ctgtccctat 
gactgtgcga 


ttggaagagt 
gttgcagaaa 


gtggtatggg 
caattgaagg 


agatgcctcc 
gtcatttgtg 


caggagtacc 
gaatttgggg 


taaagctgaa 
ccatttcaaa 


tatgatggaa 
ggaaaaagaa 


gttttaacaa 
aagatgactt 


-869 
-879 




















acaaattggg 
agcattaata 


aagcagggat 
aatcaaatta 


tgagggattc 
aaataaggct 


tcagagatga 
tagcgttaaa 


agagggaggc 
atcaaaggaa 


tttgtcatgg 
atggcaagcc 


ctttttcgat 
tggctcctgg 


gcctataatc 
agcaatgctt 


-789 
-799 




















tataataaca 
ctgaggacag 


tcagtgaagc 
tagtaaaaac 


agaattgaaa 
aatatcagac 


gccatcaagt 
aaaaagtaaa 


atgggtgtga 
gttgtattat 


atggtgcaaa 
ttagcttgag 


tacaaaggaa 
gataaagtat 


tatcaaactt 
gtcattagtt 


-709 
-719 




















cattgtggaa 
ttgtgagaga 


actgactcga 
tttggtgtcc 


ggatgatcta 
tctacaatga 


tgacatacta 
ttgttgaagt 


cagaccaaaa 
ccctatttat 


grtcjjlaagcaa 
Sgc^atacac 


caacaagttg 
aggaaacaaa 


aaacaagaga 
atcctaggat 


-629 
-639 



g jj^ati^caatgg agaaggaaaa tatttccagt -62 5 

G-Box 

ccgagaaatt aatggagatt ctggac ^cct gcag^ acacc tgttacccat tgccttcgcg aagca||itca agj|ggcagac -54 9 

caagcccctc ttaaatgaca ataatggggt taatgatgaa tatgtagcgg catgacatga atgcc||iaat tc$jccgcaac -559 



gtaaacacaa gtgaatgaag agaagccaaa ataatctcta tcattcaagc cttaggtgga gatta 

PAL 



|aaa atgatttact -54 5 



tggtttgcta aag^ggccac cagagctaac 
gactatttat ttagtattga ggaatatttt 



gaaggtitci 
ttattaia 



gtatcaaaag 
gtttgcttcc 



cggccaaggg 
gttgattacg 



469 
479 



gtcat facaga ttttagacafe 
gtatclggtg acaagcattc 

ttcttatcaa agt^ataggt gatcaacagc tttcgt^ : aaf ggtcattagg agaatattat aatctctttt atgctgaaga -465 

ccctttc^tc atggatatgj| ggcaggtccc ttattttaga attaga^atg aaaaafcctaa 
ttgattt$igg gatctactcg ataccaaccg aagccgttgt ccttga^ctt cgctt^catt 

acccaca||aa ggaagatca§! aaaatacatg actttcagat gacttcj§tgg agcttgattt 



ctgt; 
tccgi 

tctgg-gtcca ca$ 
cagcfcaagag gtg 



tag tgggaggaaa tcgtctaatg tgta^ittl| 
tea ca|g|iga|§gc acccattc^ tta|||iai^ 
"gc acccattc^ tta^^^ap^ 
ca gatatcatgg aat|^j^|cija 

gtg.gtaaggt t|at§ttccc ctglgj|gtaa ttStttttttg 

^^ctita c§q$jjjgEt^ti& aagggisgtttg i^cg|ggagt 

c cgt^tjgjg aa^Ktttg i|ta|ggagt 

a gagsaggags ggaggcagaa gagggaatag 



cccltagfcl cgtlpgtctcc 
gaa^cca^ I|a$£ctata 
gapecaggg paggctgta 
tt&tttgtgig -^aagagggag 



tttttttttg 
taattcatct 



ttaggtaa&a 
caaatiggtia;< 
caaatigtS- 
ategggcaca 



taagttaatt 
tccgtjgtgcc 

gtjggacc 

gctagj|tggt 




tttatatata ||catggtit 
fcaacaiatjjc Hagaaaiga 
Saacagat£c pa$ta, 
att tgggggg gggjggggg g 

gaialiaai iaai 

||^^t|gg gt^cts'i 
g gticti 




NtPMT3 
NtPMT2 
NtPMT4 
NtPMTla 




NtPMT3 
NtPMT2 
NtPMT4 
NtPMTla 



NtPMT3 
NtPMT2 
NtPMT4 



mm'm**$. IP ^cw 

||§|it||ggg§ |||aatcgta atacttgttg jjjjialf 
§|£&tg$&g£ Ipiaatcata atacttgttg 



NtPMTla ggfgggg^f 
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NtADCl 

ttcacgttctcttctcaattcccataaaagaaacccttccgt'ta'g 
gtttccgtcctatttt--ctcttcttctacgcttc 78 
NtADC2 

c 

tc. . .a c 

. .t. . . 80 



319 



NtADCl 

ctcttctgatatcaatatctgtatggtgtttttcttg 
ttcgaattttagatttgttttgcctttaatacctgta 
acctta 158 
NtADC2 

a 

t a. . 

160 

NtADCl 

taattctctgtttaaaccaaaaacttagcttcttctg 
aagtcagggtggggatatttggatcgtgtaagagtgt 
gttaga 238 
NtADC2 - 



239 



NtADCl 

aggtgattatcttttgattcagttccttttttgcttc 
ttttgagggggtagccggggcctcggcctcggcgggt 
tttaat 318 
NtADC2 

g 



NtADCl 

agcccccatctattacaaccattgggcaaaaacatca 
ttaaatctgtacaaagcaaacccttaatttagtttaa 
ttttct 398 
NtADC2 

t 

a 

399 



MPALGCCVDAT 
V S P P 
NtADCl 

gtattctttgattctttaacagaagaagaagagATGC 

CGGCCCTAGGTTGTTGCGTAGACGCTACT 

GTTTCCCCTCC 475 
NtADC2 

a t ATG . 

T T...G. . GTT 

479 

1 

* * * * 

16LGYAFSRDSS 
LPAPEFFTSGVPP 
T N S A 
NtADCl 

CTCGGCTATGCCTTCTCTCGGGATAGCTCTCTTCCCG 
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CGCCGGAGTTCTTTACCTCCGGCGTACCTCCTACAAA 

CTCCG 555 

NtADC2 

. . .A 

• G G 

. . .T. 559 

17 *s******** 

*******A***** 

* * * * 

43 AGSIGSPDLS 
SALYGVDGWGAPY 
F S V 

NtADCl 

CCGCCGGTTCCATTGGGTCTCCGGATCTGTCCTCTGC 
TTTGTACGGGGTCGATGGGTGGGGAGCTCCTTATTTC 
TCCGTT 63 5 
NtADC2 

. . . .T.C T. . . .G 

. . .A 

. .T. . . 639 

44 *a******** 
************* 

* * * 

69 NSNGDISVRP 
HGTDTLPHQE IDL 
L K V V 
NtADCl 

AACTCTAACGGAGATATCTC CGTC CGAC CACATGGT A 
CGGACACACTCCCCCACCAGGAAATTGACCTTCTCAA 
GGTCGT 715 



NtADC2 

T C. . . . 

T T 

719 

70 ********** 
************* 
* * * * 

96KKASDPKNSG 
GLGLQLPLVVRFP 
D V L K 
NtADCl 

GAAAAAGGCCTCCGACCCGAAAAATTCAGGGGGGCTC 
GGGCTTCAGCTGCCTCTTGTTGTTCGCTTCCCTGATG 
TGCTAA 795 
NtADC2 

T T 



. .T . G . 799 

97 ********** 
************* 
* * * * 

123 NRLESLQSAF 
DLAVHSQGYGAHY 
Q G V 
NtADCl 

AAAACCGGTTGGAATCTCTGCAATCGGCTTTTGATCT 
CGCTGTTCATTCCCAGGGCTATGGGGCCCACTACCAA 
GGTGTT 875 
NtADC2 
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. . .G 

879 

124 ********** 
************* 
* * * 

149 YPVKCNQDRF 
VVEDIVKFGSSFR 
F G L E 
NtADCl 

TATCCCGTGAAATGCAATCAAGACAGGTTCGTGGTGG 
AAGATATTGTGAAATTCGGGTCGTCATTCCGGTTCGG 
GTTGGA 955 
NtADC2 



C C 

959 

150 ********** 
**********p** 
* * * * 

176 AGSKPELLLA 
MSCLCRGSAEGLL 
V C N G 
NtADCl 

AGCTGGGTCTAAACCCGAGCTCCTGTTAGCCATGAGC 
TGTCTCTGCAGGGGCAGTGCTGAGGGCCTTCTCGTTT 
GCAATG 1035 
NtADC2 

. . .C 

A 

1039 



177 ********** 

*****K******* 
* * * * 

203 FKDAEYI SLA 
LVARKLMLNTVIV 
L E Q 
NtADCl 

GTTTCAAGGACGCTGAGTACATTTCGCTTGCTTTGGT 
TGCAAGAAAGCTCATGTTAAACACTGTAATTGTTCTT 
GAACAA 1115 
NtADC2 



G. . . 

1119 

204 ********** 
************* 

* * * 

229 EEELDLVIDI 
SRKMAVRPVIGLR 
A K L R 

NtADCl 

GAGGAGGAGCTTGACCTTGTGATTGATATAAGCCGTA 
AGATGGCTGTTCGGCCCGTAATTGGACTTCGGGCTAA 
GCTCAG 1195 
NtADC2 

A. . 

T 

1199 

230 ********** 

*H*********** 

* * * * 



f/tjuscl (c) 
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256 TKHS GHFGST 
SGEKGKFGLTTTQ 
I V R V 
NtADCl 

GACCAAGCATTCAGGCCATTTTGGATCCACTTCTGGA 
GAAAAAGGTAAGTTTGGGCTTACAACGACCCAAATTG 
TTCGTG 1275 
NtADC2 



1279 

257 ********** 
************* 

* * * * 

283 VKKLEESGML 
DCLQLLHFHIGSQ 
IPS 

NtADCl 

TAGTGAAGAAGCTGGAAGAATCCGGAATGCTGGATTG 
CCTTCAGTTGCTGCATTTTCA.CATTGGATCTCAGATC 
CCTTCA 1355 
NtADC2 

.G A 

T 

T 1359 

284 ********** 
************* 

* * * 



309 TALLADGVGE 
AAQIYCEL IRLGA 
G M K F 

NtADCl 

ACGGCGTTGCTTGCTGATGGTGTTGGTGAGGCTGCTC 
AGATTTATTGTGAATTAATCCGTCTTGGTGCGGGTAT 
GAAGTT 1435 
NtADC2 

. . . . G A A C 

G A 

1439 

310 *G******** 
********V**** 
* * * * 



336 IDTGGGLGID 
YDGTKS CDSDVSV 
G Y G I 

NtADCl 

CATTGATACTGGAGGTGGGCTCGGAATTGATTATGAT 
GGTACTAAATCATGTGATTCAGATGTCTCTGTTGGCT 
ATGGCA 1515 
NtADC2 

T T 

C T 

1519 

337 **i******* 
************* 
* * * * 



60 
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363 QEYASTVVQA 
VQYVCDRKGVKHP 
VIC 

NtADCl 

TTCAAGAATACGCCTCCACAGTTGTCCAGGCGGTTCA 
ATATGTTTGCGAC CGTAAGGGCGTGAAGCACC CAGTG 
ATTTGC 1595 
NtADC2 

T G T 

A T A 

. .C. . . 1599 

364 *****^**** 
************* 
★ * * 

389 SESGRAIVSH 
HSILIFEAVSASS 
H S C S 
NtADCl 

AGCGAAAGTGGCAGGGCAATTGTTTCTCATCACTCAA 
TTCTGATTTTCGAAGCCGTGTCTGCTTCTAGTCACTC 
ATGTTC 1675 
NtADC2 



1679 

390 ********** 
************* 
* * * * 

416 SSHLSSGGLQ 
SMAETLNEDALAD 
Y R N L 



NtADCl 

TTCTTCACATCTGTCTTCTGGTGGCCTCCAATCCATG 
GCGGAGACGCTCAATGAAGATGCCCTTGCTGATTACC 
GCAATT 1755 
NtADC2 



C 

1759 

417 ********** 
************* 
* * * * 

443 SAAAVRGEYE 
TCVLYSDQLKQRC 
V D Q 
NtADCl 

TATCTGCTGCTGCAGTTCGTGGAGAGTACGAGACGTG 
TGTACTTTACTCTGATCAGTTGAAACAGAGATGTGTG 
GATCAG 1835 
NtADC2 

T A. . 



1839 

444 ********** 
************* 
* * * 

469 FKEGSLGIEH 
LAAVDS I CDFVSK 
A M G A 
NtADCl 

TTTAAAGAAGGGTCCTTGGGTATTGAACATCTTGCTG 
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CTGTTGATAGCATCTGTGATTTTGTATCAAAGGCTAT 

GGGGGC 1915 

NtADC2 



1919 

470 ********** 
************* 
* * * * 

496 ADPI RTYHVN 
LS IFTS IPDFWAF 
G Q L F 
NtADCl 

TGCTGATCCTATCCGCACTTACCATGTGAATCTGTCA 
ATTTTCACTTCAATTCCTGATTTTTGGGCCTTTGGTC 
AATTGT 1995 
NtADC2 

G 



1999 

497 ***v****** 
************* 
* * * * 

523 PIVPIHRLDE 
KPAVRGI LSDLTC 
DSD 
NtADCl 

TTCCGATTGTTCCAATACACCGTTTAGATGAAAAGCC 
TGCAGTAAGGGGAATATTATCGGACTTGACTTGTGAC 
AGTGAT 2075 



NtADC2 

T C 

G A 

2079 

524 ********** 
************* 
* * * 

549 GKVDKFIGGE 
SSLQLHELGSNGD 
G G G Y 
NtADCl 

GGGAAGGTTGATAAGTTCATTGGTGGCGAATCAAGCT 
TGCAGCTGCATGAATTGGGAAGTAATGGCGATGGTGG 
TGGGTA 2155 
NtADC2 



. . .C. . .A 

. . . T . . 2159 

550 ********** 
***p********* 
* * * * 

576 YLGMFLGGAY 
EEALGGLHNLFGG 
P S V V 
NtADCl 

TTATCTGGGGATGTTTTTGGGTGGGGCTTATGAGGAG 
GCGCTCGGAGGACTCCACAACCTGTTTGGTGGACCAA 
GCGTGG 2235 
NtADC2 



WO 00/67558 



PCT/US00/12450 



13 / 21 



.T. .C. 2239 

577 ********** 

************* 

* * * * 

603 RVVQSDSAHS 
FAMSRSVPGPSCA 
D V L 

NtADCl 

TGCGCGTGGTGCAGAGCGATAGCGCTCACAGCTTCGC 
CATGTCTCGCTCCGTCCCTGGCCCGTCCTGCGCGGAC 
GTGCTC 2315 
NtADC2 

T. . 

. . . .A T T.T. 

2319 

604 ********** 

***T********* 

* * * 

629 RAMQHEPELM 
FETLKHRAEEFLE 
Q E E D 
NtADCl 

CGAGCGATGCAGCACGAGCCCGAGCTCATGTTCGAGA 
CTCTCAAGCAC CGTGCGGAGGAATTCTTGGAACAAGA 
AGAAGA 2395 
NtADC2 



. . .T. . 2399 



630 ********** 
************* 
* * p * 

656 KGLAIASLAS 
S LAQS FHNMPYLV 
A P A S 
NtADCl 

CAAAGGGCTGGCCATTGCATCTTTGGC CAGCAGCTTA 
GCTCAGTCCTTCCATAACATGCCTTACCTTGTGGCGC 
CTGCAT 2475 
NtADC2 

TG. . .A G. . 



. .T. . . 2479 

657 ****VE**** 
*V*********** 

* * g * 

683 CCFTAVTANN 
GGYNYYYSDENAA 
D S A 

NtADCl 

CTTGCTGCTTCACTGCAGTTACTGCTAACAACGGTGG 
CTATAACTACTATTACAGTGATGAGAATGCAGCAGAT 
TCTGCT 2555 
NtADC2 

C T.C A T 

T 

2559 

684 *R***A*D** 
************* 

* * * 



WO 00/67558 



14 / 21 



PCT/USOO/12450 



709 TGEDE IWSYC 
T A *** 

NtADCl 

ACAGGGGAGGATGAGATTTGGTCCTATTGCACTGCTT 
GAagtgttgtcgtagcatctccagttttagtttgtcg 
tcgaag 2635 
NtADC2 



. . . .g. 2639 

710 ******** 
* * * * *★* 

720 
NtADCl 

ttgtctgtttttgaataatacccttagttggtgatgt 
ttttct 



T 



GA 



c 



2678 



NtADC2 



aataata 



2682 
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N. sylvaatrla 



M« »ylve»tri» 



MPAUQCCVZMTVS PPLGYAF 8 AD38 LP APEFFT9GV9PTKSAAG8XG 

K?M^CCVDAAVa»rFOTSrLWDSBLPAPEITMOVrP9THTAVATTTTT 



HPV1ACCVCAA — APFGYAFA0DX8FFAFXAFT-«VPPATADO-TMf8NN 



3FDL33ALYOVT»IGAFTr!rVNS>«DISVKrHGTDT^ 

HWSrl^SAI.ySZDOIKAFrrTVIM8ODI8VXrHSTDTUH0BZDiajCVV 

IW8f*HB8Al.T»II)g W0A PTT T V Wa SODI>VKlWaTPTIJW0»IDLJJtVV 



IOtA3DPlW8GOUSLQLPLVVPJFWLIOflU««>aATD^ 
JOUWDflOfLiKiljCLOrPLVVKrPDIlJCHVLXJLOWrDTAVOaOOYlAHY 



f 

21 



100 
100 




145 

150 
130 



L. •seulantu* 



A. Mtiv« 

l. eoll 



N. «ylva»tria 

L. MCUltRtHB 

A. thallwu 
«. MX 

A. aatlvo 

B. coll 



L. tieiamtia 
A. tMllu* 



AZVSHK9X U r EAVSA33-HSCS S3HL8SGGLOSHM T LWCDALADY AHL 442 
AXVSHKSVl^rBAVS9TT-TA>-50KUSVCI«SFVKlUMDOMUU)riUIL 445 
AXV8HH3VLXFEAV8STT-TA — SOEIJSVTLQSFVIIU^DAAADYRNL 445 

AXVSHKSVLXr EAVSADK- FX — VHQAT FGDXQF 1XEG- 
NXEAAAKYXDL 434 

ATV8KH9VLIFKAVOT3B-TN-OGGAPPAL9AHYLAriL9KI>- YGYL 



8AAAVA BE YE TCVLY8DQLKQACVP—CrKEG8 LGIEMLAAVD- - 

SZCD 487 

8AAAIPa3EYZm^LYAIX}UCQ1lCVS^KDG0LZ)ZC0LAAVO> CZCO 490 

SAAAZAGCY1>TCVLYADgLIQ0KCVX^FIU»0LDZE0LAAVD> GICD 490 

YAAVKAOI«SCLtYVD0LlC0RCVX-GFKX<3VUZX0iASVD -GLCI 479 

8ELArAGDYXTCLVYTKIiaOUU7yE'OFKaGTVCKXQLAAVE CUT I 492 

i»V-— — KXKKHSXE-KYXLGK K 417 

431 



YHVXLSzr Tsz rorHArooLf n 

YMVN1<SZFTSVFDFWAXDQL7FX 

FV8 KAXGASDPVRT — — — YHVML9ZFT9VPDFUAXDQLFFI 

WVLXAXGASDPVHT 

YMZNLSVrrsZPOtMOZDOLrPZ 51* 

LVPJCAVGAAB8VAA-. YHVNLSXFTSVPDAWGIXQVT FX 

LSKSVTTDAKTZYN YHKMLSVF S LHPDYWGXCKLF Ftf 

EV0KOLOF0MAAKAF1 IDE WWUMADTOfYVHTS LTQSHFDAWQ IDQLF FV 



524 

327 
327 



31f 
434 

507 



A. Mtlv* 
S. eoll 



A. Mtlv* 
E. eoll 



M. aylvootrls 
L. MOulntiB 



A. Mtlra 
E. eoll 



E0ATSAZ»1«K>KJ^ffiaLAPrDVIJ0gaN9UITArANAZICYTayO8VY 
ETEEAO- -OQRLPALTCF PQX LOHALABZWAAT EHAAE8 YOYH S DY 



195 

200 
200 



0GVYPVKCN0I«FVvTDXVKFO8GFArCUAa8KPEUJJUfSCUni68m 
OGVY FA^(CN0DAFVVKDXVKFO8er8PeUAa8RPELLLAKSC LCK6SKE 



G9FD 190 



GLLVCWGFIO»rriaiJU.VAIliataJCI^^ 245 
Cl>LVCNGFKDAEYI8LAt<VAAKIJUjNTVXVLE0EEELDLVXDX8KXMAVR 230 
CU.VX^FI03AEYIBIAI^AA]UJU>rrVIVLXCCXXU>LVZDI8iaOAVA 230 




rsaaiJJUOJlTEH»qHFQ8T8U8JUiilJ ULTTYOXVP.WAKUOGGH 

LDCL 290 

FVICl^AAl*T»ISaKFQOIFAAFJ»FITJLTTA*Vli*VV»llJ>l*<»fLOCL 



OLUCrKZOSOZP8TA14AX« 
Q LLHFHX G8Q I P8TALLAD 

OAHHXVIDIGOGLG 33t 




N. eylvsetrls 



K. oylvMtrla 




OGGDGGK - YY LOKF LGGA YIEALGG UWLFOGFSV LAV SOS DS PKSFAVT 



GR-YTUOTLG0AYXEAL0aVIMLra«P8VV«V9O8I)OFKSFAVT (01 
- — EGGRYYYLQKFMGAYXEALGCWHNLrGOPSVVAVSgSDGPKSFAVT 

G YYVAVUtTGKYQEALSNKWILFGGFSLVKVVGTGNOGAFNVE 

KLGFFIfVGAYQE X LGMOOfLFGDTEAVPVBVF POO- 8VEVE 



C0I 
544 
391 




VGAE8AAAEEE LWFY-CVA 



721 
734 
133 



713 

VFAAQDL 
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LA Mutant 



Wild-type 




p-ATPase 




12 24 hr after topping 



'5 



ADC1 AOC2 



root leaf ftovw 



12 12 1 2 





1.1 kb 


1.1 kb 












0.46 kb 
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-135 -36 
ODC2 CCTACCCCTT CAACAGCTAT TTCTCTAAAA AAAAAAAAAA AAGAAGAAAA TACTACGTAG ATTACACAAT ATT AT CAGT A GTAGTATCAC TTTTCGTCCC 

ODC1 AAAGTGAGTT TCACCCAATA TGAGCGTGTG AAAAGCCCAA AAAAACAGTT TTTTTTTATT TTTTTTTATT TCCTCCAAAA AACACATTTT AAGGTATTTT 

-3 5 TATA box +1* * * * 65 

ODC2 Tcg»l*§§AT GATAAACATT TTTAGAGGTT TCCCCQTCTC AAAGGGAACA AGAOAAACAT TCAT AT TATT GAATCCCTAG TTTCTTTTCT TTCCCTTTGA 

ODC1 TAAGCACATT TTGCTCCTTT CCTTTCCGGC CTGGTATTCO ATTCCTTTAA CAATGOCTTC AACCATGGTG GCAACCTATT CTCGTTTTCT TTCCCTTTGA 

66 165 
ODC2 TTCCTTCCTC TCATTTACCT CTCTCTTTTC TTCCTTTGTT TGGATGGCCG GCCAAACAAT CATC G TTTCC GGGTTGAACC CGGCGGCCAT TCTTCAGTCC 

ODC1 TTCCTTCCTC TCATTTACCT CTATCTTTTC TTACTTTGTT TGGATGGTCG GCCAAACAAT CATCGTTTCC GGGTTGAACC CAGCGGCCAT TCTTCAGTOC 

PODC2 MA GQTI 1VS GLN PAAI LQS 

pODCl V C 

166 265 
ODC2 AC AATT GGCG GCGGAGCTTC TCCTACAGCG GCGGCGGCGG CGGAAAACGG CACCAGAAAA GTCATCCCTC TCTCAAGAGA TGC CTTACAA GATTTCATGT 

ODC1 AC AATT GGCG GCGGAGCTTC TCCTACAGCG GCGGCGGCGG CGGAAAACGA CACCAGAAAA GTCATCCCTC TTTCAAGAGA TGCCCTACAA GATTTCATGT 

pODC2 TIG GGAS PTA AAA AENG TRK VIP L S R D ALQ DFM 

pOOCl D 

266 365 

ODC2 TATCAATCAT AACCCAAAAA TTACAAGATG AGAAACAACC TTTTTACGTG CTAGACTTGG GTGAGGTTGT TTCTCTTATG GACCAATGGA AATCTGCTCT 

ODC1 TATCAATCAT AACTCAAAAA TTACAAGATG AGAAACAACC TTTTT ACGTG CTAGATTTGG GTGAGGTTGT TTCTCTTATG GACCAATGGA AATATGCTCT 

pODC2 L5II TQK LQD EKQP FYV L D L GEVV SLM DQW KSAL 

pODCl Y 

366 465 
ODC2 CCCAAATATC CGTCCATTTT ACGCTGTTAA ATGTAACCCT GAACCGTCGT TCCTTTCAAT TTTATCTGCT ATGGGCTCAA ATTTTGATTG TGCTAGCCGA 

ODC1 CCCAAATATC CGTCCATTTT ACGCTGTTAA ATGTAACCCT GAATCGTTGT TCCTTTCAAT TTTATCTGCT ATGGGCTCAA ATTTTGATTG TGCTAGCCGA 

pOOC2 PNI EPF YAVK C N P EPS F L S I LSA NGS NFDC ASR 

pODCl S L 

466 565 
ODC2 GCTGAAATTG AGTATGTTTT ATCTCTTGGC ATTTCACCTO ACCGTATTGT TTTCGCAAAT CCATOCAAAC CGGAATCCGA TATT A TTTTT GCAGCAAAAG 

ODC1 GCTGAAATTG AGTATGTTTT ATCTCTTGGC AATAAOAAAA GAGAGAGGTC AATGGGTTAC TTGATTTGAT GAAAGTTTGG GAAATTAATA TTGGGGTTGT 

PODC2 AEI EYVL SLG ISP D R I V FAN PCK PESD IIF A A K 

pODCl NKK RERS MGY LI. 

566 665 
ODCZ TTGGGGTGAA TCTTACAACC TATGATTCTG AAGACGAGGT TTACAAGATC CGAAAGCATC ACCCGAAATC CGAACTCTTG CTCCGCATCA AGCCCATGCT 

ODC1 CTTCGTATCG TCATGGGAAT CTTTAGCTGA AGTTATAACA AATTTGGAGG AGTTTCTCTT AAAAATTTGG ATTAAAATCG TGCTTGGAAC AAGAACACAC 

PODC2 VGVN LTT YDS EDEV YKI RKH HPKS ELL LRI KPML 

666 765 

ODC2 CGACGGCAAC GCGAGATGCC CAATGGGCCC GAAATACGGC GCGCTTCCAG AAGAAGTCGA CCCGCTGCTC CGGGCAGCTC AAGCCGCCCG TCTCACCGTA 

ODC1 ATGAATAAAG CGAAGAACAC CAAGACCACT GATTTCCAAA ACACCAAATT TCA A TTTTTT TAAAC G TTTT CTTTCTTGGT TGGGTGTAAA TTAAGCTTTT 

pODC2 DGN ARC PHGP KYG ALP EEVD PLL R A A Q A A R LTV 

766 865 
ODC2 TCCGGCGTCT CATTCCACAT CGGTAGCGGA GAT GCCG ATT CAAACGCTTA TCTCGGCGCC ATAGCCGCGG CTAAGGAAGT GTTTGAAACA GCTGCTAAAC 

ODC1 CTTTTCTTTT TTAGAATGTT ATTTTTATTT TATTTATTAA ATAGATTTAA CATAGTTTTT TTTACTCAAA ATAATATATG TCATTTTTTT ATTCGTCACT 

pODC2 SGV SFHI G5G DAD SNAY LGA I A A AKEV FET A A K 

866 965 

ODC2 TCGGGATGTC GAAAATGACT GTTCTAGACG TCGGCGGCGG GTTTACATCC GGCCACCAGT TCACAACCGC CGCCGTCGCC GTTAAATCAG CTTTAAAACA 

ODC1 CGCCACGTCA GCAGCGAGTG CATTGCACAA ACTTTGTAAG TTTGGCTGAT TGTTAAATAA GTGCTCAATT GGAACAAAGT TCATGTAACG TATGTGCTCA 

pODC2 LGMS KMT VLD VGGG FTS GHQ FTTA AVA VKS ALKQ 

966 1065 

ODC2 ACACTTCGAT GACGAACCGG AGTTGACAAT CATAGCTGAA CCGGGTCGGT TTTTTGCAGA GACGGCGTTT ACTTTGGCAA CGACGATTAT AGGGAAAAGA 

ODC1 ATAGGAACTC TCTTAAGTTT AGGTGTCTAA ATGAAAGATC GTGCCAACTT TAAGTGTCTC C GTATGT ATT CAGCCAAAAT AATGTAAGCC AAATGTAGTC 

pODC2 HFD DEP ELTI I A E PGR FFAE TAF TLA TTII GKR 

1066 1165 

ODC2 GTGAGGGGTG AATTGAGGGA GT AT TGGATT AACGACGGGC TGTACGGTTC GATGAACTGT GTACTTTACG ACCATGCGAC GGTGAATGC A ACGCCGTTAG 

ODC1 AATAAAGCGA TCGTGCTAGA ACCACGGGAC TCAGGGAATG CCTTACACCT TCTCCCCGGT CAACAGAATT CCTTACTCGG AGTTTGTTTT CGAAGACCAA 

pODC2 VRG ELRE YWI MDG LYGS MNC VLY DMAT V N A TPL 

1166 1265 
ODC2 CTGTTCTGTC GAATCGTAGT AACGTTACCT GCGGCGGGTC GAAAACGTTT CCGACGACTG TGTTTGGGCC CACTTGTGAT GCTCTTGATA CTGTTTTAAG 

ODC1 TAATAATAGA GTGAAACCTT CCTTTGAATA GGGATTCAAA AAAAAGGTGA CTTGGAACAC CAGCAAAAAT TAATTCCTAG TGGCGACACT GTAAATAAAA 

pODC2 AVLS NRS NVT CGGS KTF PTT VFGP TCD ALD TVLR 

1266 1365 
ODC2 GGATTACCAG TTACCGGAGC TGCAGGTTAA TGATTGGCTG GTTTTTCCTA ATATGGGTGC TTATACTAAA GCTGCTGGGT CCAATTTTAA TGGATTTAAT 

ODC1 TAATCCCTAT TTCAAATTTG TCACTTTAAT TGGAAAAACT CTTTCACCCA CAATCCATAA CAAC AC AT TA TCTTTTGGAG GTGTAAAAAG GTGATGTGAC 

pODC2 DYQ LPE LQVN DWL VFP NMGA YTK A A G SNFN GFN 

1366 1465 

ODC2 ACTTCCGCCA TTGTTACTCA CCTCGCTTAT TCTTATCCAA GCTGATGAAC CACCTGTATT AGGAATTACT ACCGTGGTTT TGATGGTTTT TTCCTTTTTT 

ODC1 AGCTCTAGCA ACTCTGCTGG GGGCTATTAA TAAGAATTCG AGCTTTGTAT ATTGATTTTT ATTTGGCTTT TATCATGTCT TGGATATTAT TGTGTTTGGG 

pODC2 TSA IVTH LAY SYP S. 

1466 Po ly A sign al 1565 

ODC2 GGGTATCTTT TTTTTAATTT TGTTGTTTTT GGTAGTAATT TATATTCCAA ATCAGCTTGT AATTCTCTTG TATGCCj ^^^^^ TGCAAGG ATTTGCTAAT 

ODC1 AGCATAATGT TCTATTTGTC TCTTATTTAT CGCTTTAATA GTTATTTAAA CTGTGATATA AATTGTATCC TATCTCGCAC CCCTCTGAGT CTTCTGATAG 

1566 4-Poly A site 1665 

ODC2 TGTGATTTTC TCTAATATGG AAGTTTTTAA AATTAGTTTA AGAAACATAA TGGGTAAAAG GTTTGTGGGG TCATGATATT TGTGTGACTA TAAAAGCATC 

ODC1 GTAGTTATGT TGTGTTTGCC TACCAGCATC ATAATATTTC TGTCTTGAGA TAAAGCCAGT TAGCCTACCA GCTTTTGGTG AAGGATTTAA TCACATATGT 
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SEQUENCE LISTING 



<110> Timko P, Michael 



<120> Regulation of Gene Expression in Tabacco for 

Manipulation of Plant Growth and Secondary Metabolism 



<130> 4981*239 



<140> 
<141> 

<160> 26 



<170> Patentln Ver. 2.0 



<210> 1 
<211> 1120 
<212> DNA 
<213> Plant 



<400> 1 

ctgagttgac aagaacaatt cctggtgaat cagatggatg aagataatag aggtgggtgg 60 

aatctataac caaagcagct ggttgagtga ctgtgcgagt tgcagaaaca attgaagggt 120 

catttgtgga atttggggcc atttcaaagg aaaaagaaaa gatgacttag cattaataaa 180 

tcaaattaaa ataaggctta gcgttaaaat caaaggaaat ggcaagcctg gctcctggag 240 

caatgcttct gaggacagta gtaaaaacaa tatcagacaa aaagtaaagt tgtattattt 300 

agcttgagga taaagtatgt cattagtttt gtgagagatt tggtgtcctc tacaatgatt 360 

gttgaagtcc ctatttatag ctatacacag gaaacaaaat cctaggatca agcccctctt 420 

aaatgacaat aatggggtta atgatgaata tgtagcggca tgacatgaat gccaaaattc 480 

tccgcaacga ctatttattt aatattgagg aatatttttt attaaatact atctggtgac 540 

aagcattcgt ttgcttccgt tgattacgtt gattttggga tctactctat accaaccgaa 600 

gccgttgtcc ttgatcttcg ctttcattta attcatcttc cgtctgcctc cgatttcaca 660 

agtcatgcac ccattcaatt atttaatgga aaccaatttt accctataca aatggtacat 720 

cattcgtcaa atactttact tggatataaa caattttgcc cgaggagtaa acagatgcga 780 

agaaagaaag cagacgatta aagaaatttt taaaaaagga gagagaaatg aacacacaca 840 

tgtactaata aaattagggt actactttac taataattgg acagagacta aattcatatt 900 

ttagttccaa aatgtctcgg gcagtccaac catgcacgtt gtaatgattt tttaactcta 960 

ttatatcgag ttgcgccctc cactcctcgg tgtccaaatt gtatataaat gcatatgtgt 1020 

ctattgggag tgtacatcaa gctttcataa agtacaaatc gtaatacttg ttgaaacata 1080 
atactttctc ttctccaatt tgtttagttt aattttgaaa 



1120 



<210> 2 
<211> 3091 
<212> DNA 
<213> Plant 



<400> 2 
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ctgagttgac aagaacaatt cctggtgaat 
aatctataac caaagcagct ggttgagtga 
catttgtgga atttggggcc atttcaaagg 
tcaaattaaa ataaggctta gcgttaaaat 
caatgcttct gaggacagta gtaaaaacaa 
agcttgagga taaagtatgt cattagtttt 
gttgaagtcc ctatttatag ctatacacag 
aaatgacaat aatggggtta atgatgaata 
tccgcaacga ctatttattt aatattgagg 
aagcattcgt ttgcttccgt tgattacgtt 
gccgttgtcc ttgatcttcg ctttcattta 
agtcatgcac ccattcaatt atttaatgga 
cattcgtcaa atactttact tggatataaa 
agaaagaaag cagacgatta aagaaatttt 
tgtactaata aaattagggt actactttac 
ttagttccaa aatgtctcgg gcagtccaac 
ttatatcgag ttgcgccctc cactcctcgg 
ctattgggag tgtacatcaa gctttcataa 
atactttctc ttctccaatt tgtttagttt 
cacaaatggc tctaccatct tcaagagtgg 
cacttccaaa caccaaaacg gccacaagaa 
cagccttgat aatggcaacg agctactggg 
ttcagagttt agcgcattat ggccaggtta 
aaagttaaaa ttgttaggct aatataagga 
ggaaaaagta tcaaataaat tcaaaaaatg 
taatttgaaa taaatcgaat tttgcaggtg 
tgttccaggg gaagtctgac taccaagatg 
tacacatgct tccatttaaa ttgatacttt 
gtacagtcag caacttatgg gaaggttctg 
aatggtggat ttccatacac tgaaatgatt 
ccaaaaaagg ttttgatcat cggcggagga 
tatcctacaa tcgaaaaaat tgacattgtt 
caaacttctt ttactcacat aaaaaaatgg 
gaatactatt tttttaaaac aaaattttct 
tatctcgctg ctaattttaa cgatcctcgt 
ttgataatct cgcttttgtt ttatctttta 
tgtgtggtta attcacctgc cattggttct 
ctgcacaagc agaatattat gatgctatta 
tattacttct taataccaag actaatctta 
tttctaaaac aatataattt caggtccagc 
ggcagtagct aaagccctaa ggccaggagg 
gcttcatatg catattatta agcaaatcat 
tgtcaactat gcttggacta ctgttccaac 
ttcctataaa attggaagtt ttgattctat 
gaaaaaccaa cttcttttct tttactcttc 
atatgatcaa ttattttgat ttcagcggtg 
gaccagaaat tgacttcaag aatccagtaa 
agtccaaatt agcacctctc aagttctaca 



cagatggatg aagataatag aggtgggtgg 60 
ctgtgcgagt tgcagaaaca attgaagggt 120 
aaaaagaaaa gatgacttag cattaataaa 180 
caaaggaaat ggcaagcctg gctcctggag 240 
tatcagacaa aaagtaaagt tgtattattt 300 
gtgagagatt tggtgtcctc tacaatgatt 360 
gaaacaaaat cctaggatca agcccctctt 420 
tgtagcggca tgacatgaat gccaaaattc 480 
aatatttttt attaaatact atctggtgac 540 
gattttggga tctactctat accaaccgaa 600 
attcatcttc cgtctgcctc cgatttcaca 660 
aaccaatttt accctataca aatggtacat 720 
caattttgcc cgaggagtaa acagatgcga 78 0 
taaaaaagga gagagaaatg aacacacaca 840 
taataattgg acagagacta aattcatatt 900 
catgcacgtt gtaatgattt tttaactcta 960 
tgtccaaatt gtatataaat gcatatgtgt 1020 
agtacaaatc gtaatacttg ttgaaacata 1080 
aattttgaaa atggaagtca tatctaccaa 1140 
tgccattccc atgaatggcc accataatgg 1200 
tgggacttcc gaacaacaga acgggacaat 1260 
aaactccaat tgtattaagc ctggttggtt 1320 
gtactgagaa agaaactcaa atgcatattt 1380 
gttgatattc ttttagtgat taattaaaaa 1440 
gatagtaact tcgcatatta ctctacacat 1500 
aagcattctc acttaaggtt gagaagttac 1560 
tcatgctctt tgaggtaaat aatattttaa 1620 
taatttactt ttactttatt gcatgtgtac 1680 
actttggatg gagcaattca acacacagag 1740 
gttcatcttc cacttggttc catcccaaac 1800 
attggtttta cattattcga aatgcttcgt 1860 
gagatcgatg acgtggtagt tgatgtaagt 1920 
tttagattgc ttcttgttat ttttctaaaa 1980 
tttttacagg tatctagaaa atttttccct 2040 
gtaaccctag tccttggaga tggtgcgtat 2100 
tttttattgc atttaatttt taccttttgg 2160 
ctttcatttc aggggctgca tttgtaaagg 2220 
tagtggactc ttctgatccc attggtactc 2280 
ttgaataagc tactaataaa cggtaattga 2340 
aaaagatttg tttgagaggc cattctttga 2400 
agttgtatgc acacaggctg aaagcatttg 2460 
tgctaactgt cgtcaagtct ttaagggctc 2520 
atatccaacg tatttttctc tctctctctc 2580 
aattgtcaag aaatggagaa tcagttccaa 2640 
aaggtattgt gtttaatttt ttttcaactg 2700 
tgattggtta tatgctctgc tctactgaag 2760 
atccaattga caaagagaca gctcaagtca 2820 
actctgatgt aacttcatat ctcacaattt 2880 
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cttttttcct attgtacttt atgttcttcg tcaaatttta taattaactc ttttcaaatt 2940 
gtcttttttt ttttcagatt cacaaagcag cattcatttt gccatctttc gccagaagta 3000 
tgatcgagtc ttaatcaact gattaatgaa tactggtggt acaatcattg gaccaagatc 3060 
aataagtgaa agacgtattg tatgagaatt c 3091 

<210> 3 
<211> 353 
<212> PRT 
<213> Plant 

<400> 3 

Met Glu Val lie Ser Thr Asn Thr Asn Gly Ser Thr lie Phe Lys Ser 
15 10 15 

Gly Ala lie Pro Met Asn Gly His His Asn Gly Thr Ser Lys His Gin 
20 25 30 

Asn Gly His Lys Asn Gly Thr Ser Glu Gin Gin Asn Gly Thr lie Ser 
35 40 45 

Leu Asp Asn Gly Asn Glu Leu Leu Gly Asn Ser Asn Cys lie Lys Pro 
50 55 60 

Gly Trp Phe Ser Glu Phe Ser Ala Leu Trp Pro Gly Glu Ala Phe Ser 
65 70 75 80 

Leu Lys Val Glu Lys Leu Leu Phe Gin Gly Lys Ser Asp Tyr Gin Asp 
85 90 95 

Val Met Leu Phe Glu Ser Ala Thr Tyr Gly Lys Val Leu Thr Leu Asp 
100 105 110 

Gly Ala lie Gin His Thr Glu Asn Gly Gly Phe Pro Tyr Thr Glu Met 
115 120 125 

lie Val His Leu Pro Leu Gly Ser lie Pro Asn Pro Lys Lys Val Leu 
130 135 140 

lie lie Gly Gly Gly lie Gly Phe Thr Leu Phe Glu Met Leu Arg Tyr 
145 150 155 160 

Pro Thr lie Glu Lys lie Asp lie Val Glu lie Asp Asp Val Val Val 
165 170 175 

Asp Val Ser Arg Lys Phe Phe Pro Tyr Leu Ala Ala Asn Phe Asn Asp 
180 185 190 

Pro Arg Val Thr Leu Val Leu Gly Asp Gly Ala Ala Phe Val Lys Ala 
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195 200 205 

Ala Gin Ala Glu Tyr Tyr Asp Ala lie lie Val Asp Ser Ser Asp Pro 
210 215 220 

lie Gly Pro Ala Lys Asp Leu Phe Glu Arg Pro Phe Phe Glu Ala Val 
225 230 235 240 

Ala Lys Ala Leu Arg Pro Gly Gly Val Val Cys Thr Gin Ala Glu Ser 
245 250 255 

lie Trp Leu His Met His lie lie Lys Gin lie lie Ala Asn Cys Arg 
260 265 270 

Gin Val Phe Lys Gly Ser Val Asn Tyr Ala Trp Thr Thr Val Pro Thr 
275 280 285 

Tyr Pro Thr Gly Val lie Gly Tyr Met Leu Cys Ser Thr Glu Gly Pro 
290 295 300 

Glu lie Asp Phe Lys Asn Pro Val Asn Pro lie Asp Lys Glu Thr Ala 
305 310 315 320 

Gin Val Lys Ser Lys Leu Ala Pro Leu Lys Phe Tyr Asn Ser Asp lie 
325 330 335 

His Lys Ala Ala Phe lie Leu Pro Ser Phe Ala Arg Ser Met lie Glu 
340 345 350 

Ser 



<210> 4 
<211> 711 
<212> DNA 
<213> Plant 

<400> 4 

gaattcaatg gagaaggaaa atatttccag tgtaaacaca agtgaatgaa gagaagccaa 60 
aataatctct atcattcaag ccttaggtgg agattaaaaa aattatttac tttcttatca 120 
aagtaatagg tgatcaacag ctttcgtaaa acgtcattag gagaatatta taatctcttt 180 
tatgctgaag aacccacata aggaagatca taaaatacat gactttcaga tgacttcttg 240 
gagctttatt tttaaagagt ggctagctgg tcagcaaaga ggtgctcgtc agatatcata 300 
aaattttact attatttgtt ttaagaggga gatggggcac acatgcttgt gacaaaagta 360 
agaggaagaa aggagacaga agaggaaata gatttggggg gggggggggg ggtttcacaa 420 
tcaaagaaaa tttttaaaat ggagagagaa atgagcacac acatatacta acaaaatttt 480 
actaataatt gcaccgagac aaacttatat tttagttcca aaatgtcagt ctaaccctgc 540 
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acgttgtaat gaatttttaa ctattatatt atatcgagtt gcgccctcca ctcctcggtg 600 

tccaaattgt atttaaatgc atagatgttt attgggagtg tacagcaagc tttcggaaaa 660 

tacaaaccat aatactttct cttcttcaat ttgtttagtt taattttgaa a 711 

<210> 5 
<211> 3129 
<212> DNA 
<213> Plant 

<400> 5 

gaattcaatg gagaaggaaa atatttccag tgtaaacaca agtgaatgaa gagaagccaa 60 

aataatctct atcattcaag ccttaggtgg agattaaaaa aattatttac tttcttatca 120 

aagtaatagg tgatcaacag ctttcgtaaa acgtcattag gagaatatta taatctcttt 180 

tatgctgaag aacccacata aggaagatca taaaatacat gactttcaga tgacttcttg 240 

gagctttatt tttaaagagt ggctagctgg tcagcaaaga ggtgctcgtc agatatcata 300 

aaattttact attatttgtt ttaagaggga gatggggcac acatgcttgt gacaaaagta 360 

agaggaagaa aggagacaga agaggaaata gatttggggg gggggggggg ggtttcacaa 420 

tcaaagaaaa tttttaaaat ggagagagaa atgagcacac acatatacta acaaaatttt 480 

actaataatt gcaccgagac aaacttatat tttagttcca aaatgtcagt ctaaccctgc 540 

acgttgtaat gaatttttaa ctattatatt atatcgagtt gcgccctcca ctcctcggtg 600 

tccaaattgt atttaaatgc atagatgttt attgggagtg tacagcaagc tttcggaaaa 660 

tacaaaccat aatactttct cttcttcaat ttgtttagtt taattttgaa aatggaagtc 720 

atatctacca acacaaatgg ctctaccatc ttcaagaatg gtgccattcc catgaacggc 780 

caccaaaatg gcacttctga acacctcaac ggctaccaga atggcacttc caaacaccaa 840 

aacgggcacc agaatggcac tttcgaacat cggaacggcc accagaatgg gacatccgaa 900 

caacagaacg ggacaatcag ccatgacaat ggcaacgagc tactgggaag ctccgactct 960 

attaagcctg gctggttttc agagtttagc gcattatggc caggttagta ctaagaaagc 1020 

aactcaaatg catcggcctc ttgttgctac taaatataga gagctatcat acttttaggg 1080 

actaactaaa aaggaaagat tatcacaggg acgaagtgag cagttaactt cgcatattat 1140 

cagacgcatt aatttgaaat aatcgaattt tgcaggtgaa gcattctcac ttaaggttga 1200 

gaagttacta ttccagggga agtctgatta ccaagatgtc atgctctttg aggtaattaa 1260 

tattctaata cacatgcttt aatttaaagt gatactttta atttactttt agtttattgc 1320 

atgtgcacgt acagtcagca acttatggga aggttctgac tttggatgga gcaattcaac 1380 

atacagagaa tggtggattt ccatacactg aaatgattgt tcatctacca cttggttcca 1440 

tcccaaaccc aaaaaaggtt ttgatcatcg gcggaggaat tggttttaca ttattcgaaa 1500 

tgcttcgtta tccttcaatc gaaaaaattg acattgttga gatcgatgac gtggtagttg 1560 

atgtaagtca aacttctttt acccacataa agaaaatgat ttagattgca attcttttta 1620 

tttttctaaa agaataaata tattctcttt ttttttttta aaacaaaatt ctctttctta 1680 

caggtatcca gaaaattttt cccttatctg gcagctaatt ttaacgatcc tcgtgtaacc 1740 

ctagttctcg gagatggtgc gtatatgata gtctcgtttt atattttatt tcacttgatt 1800 

tttacctttt tttgtggtta attaatcatc taccattggt tctctttacc ttcaggagct 1860 

gcatttgtaa aggctgcaca agcgggatat tatgatgcta ttatagtgga ctcttctgat 1920 

cccattggta cgctattact atttaatacc aagactattc ttattaaata agctactaag 1980 

aaactaattg aataattaat aaacgtaact gtaattgatt tctaaaataa tatatataat 2040 

ttcaggtcca gcaaaagatt tgtttgagag gccattcttt gaggcagtag ccaaagccct 2100 

taggccagga ggagttgtat gcacacaggc tgaaagcatt tggcttcata tgcatattat 2160 

taagcaaatc attgctaact gtcgtcaagt ctttaagggt tctgtcaact atgcttggac 2220 

aaccgttcca acatatccca cgtattcttt ttctctctct ctcttcctgt ctttttcgat 2280 
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gcaatgtaaa tttataaaat tggaagtccg ttttactttt ctatagacgt 
attgtcaaga aatggagaat tgacttacaa gaaaaatcaa cttcttttca 
tttttggtga caaactttac ttattatttc gttctaaaat gaaaatttat 
taaaataatt tagctttaaa cttttaattt tacttgttat atttttaata 
atagtcaaat aaatgttgtg accatataaa aacctccgca tttttaagat 
agagtcaaac gagttaattt atttttagta tgccggtgcg gagtcaaatt 
aattgaaacg gagtgagaac atttttattt cgagtaaact ttcaaggtat 
ttcaagtgat actgatcaat gatgtcttaa atattttgat ttcagcggtg 
tatgctctgc tctactgaag ggccagaagt tgacttcaag aatccagtaa 
caaagagaca actcaagtca agtccaaatt aggacctctc aagttctaca 
aacttcatat ctcacaattt ctttttccgt tttactgtat gttcttcgtc 
actaactctt ttcatattgt cttttttttc agattcacaa agcagcattc 
ctttcgccag aagtatgatc gagtcttaat caagtgaata atgaacactg 
cattggacca agatcgagtc ttaatcaagt gaataaataa gtgaaatgcg 
ggagaattc 

<210> 6 

<211> 375 

<212> PRT 

<213> Plant 

<400> 6 

Met Glu Val lie Ser Thr Asn Thr Asn Gly Ser Thr lie Phe Lys Asn 
15 10 15 

Gly Ala lie Pro Met Asn Gly His Gin Asn Gly Thr Ser Glu His Leu 
20 25 30 

Asn Gly Tyr Gin Asn Gly Thr Ser Lys His Gin Asn Gly His Gin Asn 
35 40 45 

Gly Thr Phe Glu His Arg Asn Gly His Gin Asn Gly Thr Ser Glu Gin 
50 55 60 

Gin Asn Gly Thr lie Ser His Asp Asn Gly Asn Glu Leu Leu Gly Ser 
65 70 75 80 

Ser Asp Ser lie Lys Pro Gly Trp Phe Ser Glu Phe Ser Ala Leu Trp 
85 90 95 

Pro Gly Glu Ala Phe Ser Leu Lys Val Glu Lys Leu Leu Phe Gin Gly 
100 105 110 

Lys Ser Asp Tyr Gin Asp Val Met Leu Phe Glu Ser Ala Thr Tyr Gly 
115 120 125 

Lys Val Leu Thr Leu Asp Gly Ala lie Gin His Thr Glu Asn Gly Gly 
130 135 140 



agatcctaaa 2340 
tttactattc 2400 
ttttatattt 2460 
aaaaagattt 2520 
cataagtttc 2580 
atgtcataaa 2640 
tgtgtttaat 2700 
tgatcggtta 2760 
atccaattga 2820 
actctgatgt 2880 
aaattttata 2940 
attttaccat 3000 
gtagtacaat 3060 
acgtattgta 3120 
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Phe Pro Tyr Thr Glu Met lie Val His Leu Pro Leu Gly Ser lie Pro 
145 150 155 160 

Asn Pro Lys Lys Val Leu lie lie Gly Gly Gly lie Gly Phe Thr Leu 
165 170 175 

Phe Glu Met Leu Arg Tyr Pro Ser lie Glu Lys lie Asp lie Val Glu 
180 185 190 

lie Asp Asp Val Val Val Asp Val Ser Arg Lys Phe Phe Pro Tyr Leu 
195 200 205 

Ala Ala Asn Phe Asn Asp Pro Arg Val Thr Leu Val Leu Gly Asp Gly 
210 215 220 

Ala Ala Phe Val Lys Ala Ala Gin Ala Gly Tyr Tyr Asp Ala lie lie 
225 230 235 240 

Val Asp Ser Ser Asp Pro lie Gly Pro Ala Lys Asp Leu Phe Glu Arg 
245 250 255 

Pro Phe Phe Glu Ala Val Ala Lys Ala Leu Arg Pro Gly Gly Val Val 
260 265 270 

Cys Thr Gin Ala Glu Ser lie Trp Leu His Met His lie lie Lys Gin 
275 280 285 

lie He Ala Asn Cys Arg Gin Val Phe Lys Gly Ser Val Asn Tyr Ala 
290 295 300 

Trp Thr Thr Val Pro Thr Tyr Pro Thr Gly Val He Gly Tyr Met Leu 
305 310 315 320 

Cys Ser Thr Glu Gly Pro Glu Val Asp Phe Lys Asn Pro Val Asn Pro 
325 330 335 

lie Asp Lys Glu Thr Thr Gin Val Lys Ser Lys Leu Gly Pro Leu Lys 
340 345 350 

Phe Tyr Asn Ser Asp He His Lys Ala Ala Phe He Leu Pro Ser Phe 
355 360 365 

Ala Arg Ser Met He Glu Ser 
370 375 



<210> 7 
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<211> 1134 
<212> DNA 
<213> Plant 



<400> 7 

gctgtacaaa aggatgtctc aaatcatttg gaatattaat tctgcaatca acaagaaata 60 
ccccactatt aagacccatt atcactggca caaaaattat gagatcatta aacatcttaa 120 
acctgtccct atttggaaga gtgtggtatg ggagatgcct cccagggagt acctaaagct 180 
gaatactgat ggaagtttta acaaacaaat tgggaaagca gggattggag ggattctcag 240 
agatgaagag ggaggctttg tcatggcttt ttcgatgcct ataatctata ataacatcag 300 
tgaagcagaa ttgaaagcca tcaagtatgg gtgtgaatgg tgcaaataca aaggaatatc 360 
aaacttcatt gtggaaactg actcgaggat gatctatgac atactacaga ccaaaaatct 420 
aagcaacaac aagttgaaac aagagaccga gaaattaatg gagattctgg acacctgcag 480 
gacacctgtt acccattgcc ttcgcgaagc aaatcaagtg gcagactggt ttgctaaaga 540 
ggccaccaga gctaacgaag gtatcactca tacagatttt agacaggtat caaaagcggc 600 
caagggccct ttcttcatgg atatgtggca ggtcccttat tttagaatta gatatgaaaa 660 
atctaatttt tttttgtaag ttaattctgt gtatagtgag aggaaatcgt ctaatatgta 720 
tttttgccca tagactcttc ctctccttag gtaaaaaggt agctccgagg taaggtttat 780 
gttcccctca gtgtaacctt tttttgttta tataatagac atggtatggg tccagctaaa 840 
cccccaacac cacaggggat agatacctgg gtgattggtt tattttttaa aaaaaaaaac 900 
tttactaata attgcacgga gacaaaactt atattttagt tccaaaatga cagtccaacc 960 
atgcacgttg taatgatttt ttaactctat tatatcgagt tccgccctcc actcctcggt 1020 
gtccaaattg tatttaaatg catagatatg tttattggga gtgtacatca agctttcaga 1080 
aaatacaaac cataatactt tctcttctcc aatttgctta gtttaatttg gaaa 1134 



<210> 8 
<211> 3269 
<212> DNA 
<213> Plant 



<400> 8 

gctgtacaaa aggatgtctc aaatcatttg gaatattaat tctgcaatca acaagaaata 60 

ccccactatt aagacccatt atcactggca caaaaattat gagatcatta aacatcttaa 120 

acctgtccct atttggaaga gtgtggtatg ggagatgcct cccagggagt acctaaagct 180 

gaatactgat ggaagtttta acaaacaaat tgggaaagca gggattggag ggattctcag 240 

agatgaagag ggaggctttg tcatggcttt ttcgatgcct ataatctata ataacatcag 300 

tgaagcagaa ttgaaagcca tcaagtatgg gtgtgaatgg tgcaaataca aaggaatatc 360 

aaacttcatt gtggaaactg actcgaggat gatctatgac atactacaga ccaaaaatct 420 

aagcaacaac aagttgaaac aagagaccga gaaattaatg gagattctgg acacctgcag 480 

gacacctgtt acccattgcc ttcgcgaagc aaatcaagtg gcagactggt ttgctaaaga 540 

ggccaccaga gctaacgaag gtatcactca tacagatttt agacaggtat caaaagcggc 600 

caagggccct ttcttcatgg atatgtggca ggtcccttat tttagaatta gatatgaaaa 660 

atctaatttt tttttgtaag ttaattctgt gtatagtgag aggaaatcgt ctaatatgta 720 

tttttgccca tagactcttc ctctccttag gtaaaaaggt agctccgagg taaggtttat 780 

gttcccctca gtgtaacctt tttttgttta tataatagac atggtatggg tccagctaaa 840 

cccccaacac cacaggggat agatacctgg gtgattggtt tattttttaa aaaaaaaaac 900 

tttactaata attgcacgga gacaaaactt atattttagt tccaaaatga cagtccaacc 960 

atgcacgttg taatgatttt ttaactctat tatatcgagt tccgccctcc actcctcggt 1020 
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gtccaaattg tatttaaatg catagatatg tttattggga gtgtacatca agctttcaga 1080 
aaatacaaac cataatactt tctcttctcc aatttgctta gtttaatttg gaaaatggaa 1140 
gtcatatcta ccaacacaaa tggctctact atcttcaaga atggtgccat tcccatgaac 1200 
ggttaccaga atggcacttc caaacaccaa aacggccacc agaatggcac ttccgaacat 1260 
cggaacggcc accagaatgg gatttccgaa caccaaaacg gccaccagaa tggcacttcc 1320 
gagcatcaga acggccatca gaatgggaca atcagccatg acaacggcaa cgagctacag 1380 
ctactgggaa gctccaactc tattaagcct ggttggtttt cagagtttag cgcattatgg 1440 
ccaggttagt actaagaaag aaactcaaat gcatcgtact cttgtattct gctttgcgta 1500 
taatttagat gatggtgttt gactaagcac tgagtttaaa aataaaaagt ttaaagttaa 1560 
attgttacta tagagagcta tatctttagg aactaactaa aaaggaaaaa ttatcacata 1620 
aaattgggat gaagtaagca gttaacttcg catattattc gacacattaa tttgaaataa 1680 
atcgaatttt gcaggtgaag cattctcact taaggttgag aagttactat tccaggggaa 1740 
gtctgattac caagatgtca tgctctttga ggtaattaat taatactaat agtcaagctc 1800 
atgtatgatt atatttaaag tggtattttt cgtttatttt taatttattg cacgtgtacg 1860 
tacagtcagc aacatatggg aaggttctga ctttggatgg agcaattcaa cacacagaga 1920 
atggtggatt tccatacact gaaatgattg ttcatcttcc acttggttcc atcccaaacc 1980 
ctaaaaaggt tttgatcatc ggcggaggaa ttggttttac attattcgaa atgcttcgtt 2040 
atcctacaat cgaaaaaatt gacattgttg agatcgatga cgtggtagtt gatgtaagtc 2100 
aaacttcttt tactcacata aaaaaatgat ttagattctt atttttctaa aagaattaaa 2160 
acaaaatttt ccgttttaca ggtatctaga aaatttttcc cttatcttgc tgctaatttt 2220 
agcgatcctc gtgtaaccct agtccttgga gatggtgcgt atttgataat ctcgttttta 2280 
ttttatcttt tacttttatt ttatttaatt tttacctttt tgtgtgtggt taattcacct 2340 
gccattggtt ctttttattt caggggctgc atttgtaaag gccgcacaag caggatatta 2400 
tgatgctatt atagtggact cttctgatcc cattggtact ctattactac ttaataccaa 2460 
gactattctt attaaataag ctactaataa acgtaactct gatagttttc taaaataata 2520 
taatttcagg tccagcaaaa gacttgtttg agaggccatt ctttgaggca gtagccaaag 2580 
ccctaaggcc aggaggagtt gtatgcacac aggctgaaag catttggctt catatgcata 2640 
ttattaagca aatcattgct aactgtcgtc aagtctttaa gggctctgtc aactatgctt 2700 
ggactactgt tccaacatat ccaacgtatt tttctctctc tcttcctata aaattggaag 2760 
ttttgattct ataattgtca agaaatggag aatcagttcc aagaaaaacc aaattctttt 2820 
cttttactct tcaaggtgtg tttaagtttt ttaaactgat actgatcaat tattttgatt 2880 
tcagcggtgt gattggttat atgctctgtt ctactgaagg accagaagtt gacttcaaga 2940 
atccagtaaa tccaattgac aaagagacaa ctcaagtcaa gtccaaatta gcacctctca 3000 
agttctacaa ctctgatgta acttcatatc tcaatttctt ttttcttatt gtactttatg 3060 
ttcttagtca aattttataa ttaactcttt tcaaattgtc tttttttttc agattcacaa 3120 
agcagcattc attttgccat ctttcgccag aagtatgatc gagtcttaat caagtgacta 3180 
atgaatactg gcggtacaat cattggacca agatcgagtc ttaatcaagt gaataaataa 3240 
gtgaaatgcg acgtattgta taagaattc 3269 

<210> 9 

<211> 381 

<212> PRT 

<213> Plant 

<400> 9 

Met Glu Val lie Ser Thr Asn Thr Asn Gly Ser Thr He Phe Lys Asn 
15 10 15 
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Gly Ala lie Pro Met Asn Gly Tyr Gin Asn Gly Thr Ser Lys His Gin 
20 25 30 

Asn Gly His Gin Asn Gly Thr Ser Glu His Arg Asn Gly His Gin Asn 
35 40 45 

Gly lie Ser Glu His Gin Asn Gly His Gin Asn Gly Thr Ser Glu His 
50 55 60 

Gin Asn Gly His Gin Asn Gly Thr lie Ser His Asp Asn Gly Asn Glu 
65 70 75 80 

Leu Gin Leu Leu Gly Ser Ser Asn Ser lie Lys Pro Gly Trp Phe Ser 
85 90 95 

Glu Phe Ser Ala Leu Trp Pro Gly Glu Ala Phe Ser Leu Lys Val Glu 
100 105 110 

Lys Leu Leu Phe Gin Gly Lys Ser Asp Tyr Gin Asp Val Met Leu Phe 
115 120 125 

Glu Ser Ala Thr Tyr Gly Lys Val Leu Thr Leu Asp Gly Ala lie Gin 
130 135 140 

His Thr Glu Asn Gly Gly Phe Pro Tyr Thr Glu Met lie Val His Leu 
145 150 155 160 

Pro Leu Gly Ser lie Pro Asn Pro Lys Lys Val Leu lie lie Gly Gly 
165 170 175 

Gly lie Gly Phe Thr Leu Phe Glu Met Leu Arg Tyr Pro Thr lie Glu 
180 185 190 

Lys lie Asp lie Val Glu lie Asp Asp Val Val Val Asp Val Ser Arg 
195 200 205 

Lys Phe Phe Pro Tyr Leu Ala Ala Asn Phe Ser Asp Pro Arg Val Thr 
210 215 220 

Leu Val Leu Gly Asp Gly Ala Ala Phe Val Lys Ala Ala Gin Ala Gly 
225 230 235 240 

Tyr Tyr Asp Ala lie lie Val Asp Ser Ser Asp Pro lie Gly Pro Ala 
245 250 255 

Lys Asp Leu Phe Glu Arg Pro Phe Phe Glu Ala Val Ala Lys Ala Leu 
260 265 270 
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Arg Pro Gly Gly Val Val 
275 

Met His lie lie Lys Gin 
290 

Gly Ser Val Asn Tyr Ala 
305 310 

Val He Gly Tyr Met Leu 
325 

Lys Asn Pro Val Asn Pro 
340 

Lys Leu Ala Pro Leu Lys 
355 

Phe He Leu Pro Ser Phe 
370 



Cys Thr Gin Ala Glu 
280 

He He Ala Asn Cys 
295 

Trp Thr Thr Val Pro 
315 

Cys Ser Thr Glu Gly 
330 

He Asp Lys Glu Thr 
345 

Phe Tyr Asn Ser Asp 
360 

Ala Arg Ser Met He 
375 
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Ser He Trp Leu His 
285 

Arg Gin Val Phe Lys 
300 

Thr Tyr Pro Thr Gly 
320 

Pro Glu Val Asp Phe 
335 

Thr Gin Val Lys Ser 
350 

He His Lys Ala Ala 
365 

Glu Ser 
380 



<210> 10 
<211> 469 
<212> DNA 
<213> Plant 

<400> 10 

gtcgacctct gattccacaa gtcatgcacc cattcaatta tttaatggaa accaatttta 60 

ccctgtacaa atggtacaaa tactttcctt ggataaaaac aattttgcct aaggagtaaa 120 

cagatgcgaa gtaagaaagc agacgactaa agaaaatttt aaaaaaggag agagaaatga 180 

gcacacacac gtactaataa aattagggta ctactttact aataattgga cagagactaa 240 

attcatattt tagttccaaa atgtctcggg cagtccaacc atgcacgttg taatgagttt 300 

ttaactctat tatctcgagt tgcgccctcc actcctctgt gtccaagttg tatataaatg 360 

catatatgtc tattgggagt gtacagcgag ctttcataaa gtacaaatca taatacttgt 420 

tgaaacataa tactttctct tctccaattt gtttagttta attttgaaa 469 

<210> 11 

<211> 3001 

<212> DNA 

<213> Plant 

<400> 11 

gtcgacctct gattccacaa gtcatgcacc cattcaatta tttaatggaa accaatttta 60 

ccctgtacaa atggtacaaa tactttcctt ggataaaaac aattttgcct aaggagtaaa 120 

cagatgcgaa gtaagaaagc agacgactaa agaaaatttt aaaaaaggag agagaaatga 180 

gcacacacac gtactaataa aattagggta ctactttact aataattgga cagagactaa 240 

attcatattt tagttccaaa atgtctcggg cagtccaacc atgcacgttg taatgagttt 300 
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ttaactctat tatctcgagt tgcgccctcc actcctctgt gtccaagttg tatataaatg 360 
catatatgtc tattgggagt gtacagcgag ctttcataaa gtacaaatca taatacttgt 420 
tgaaacataa tactttctct tctccaattt gtttagttta attttgaaaa tggaagtcat 480 
atctaccaac acaaatggct cgaccatctt caagaatggt gccattccca tgaatggcca 540 
ccagagtggc acttccaaac acctcaacgg ctaccagaac ggcacttcca aacaccaaaa 600 
cggccaccat aatggcactt ccgaacatcg gaacggccac cagaatggga tttccgaaca 660 
ccaaaacggc caccagaatg ggacttccga acatcggaac ggccaccaga atgggatttc 720 
cgaacaccaa aacggccacc agaatgggac ttccgaacac caaaacggcc accagaatgg 780 
gacttccgaa caacagaacg ggacaatcag ccatgacaat ggcaacgagc tactgggaaa 840 
ctccaactct attaagcttg gttggttttc agagtttagc gcattatggc caggttagta 900 
ctgagaaaga aactcaaatt catatttaaa gttaaaattg ttaggctaat ataagaagtt 960 
gattttcttt tagtgattaa ttaaaaaagg aaagagtatc aaataaattc caaaaaatga 1020 
ccagtaactt cgcatattat tctacacatt aatttgaaat aaatcgaatt ttgcaggtga 1080 
agcattctcc cttaaggttg agaagttact atttcagggg aagtctgact accaagatgt 1140 
catgctcttt gaggtaaata atattctaat acacatgctt taatatgaat aaatactttt 1200 
aatttacttt tagtttattg cacgtgtacg tacagtcagc aacatatggg aaggttttga 1260 
ctttggatgg agcaattcaa cacacagaga atggtggatt tccatacact gaaatgattg 1320 
ttcatcttcc acttggttcc atcccaaacc caaaaaaggt tttgatcatc ggcggaggaa 1380 
ttggttttac attattcgaa atgcttcgtt atcctacaat cgaaaaaatt gacattgttg 1440 
aaatcgatga cgtggtagtt gatgtaagtc aaatttcttt tactcacata aaaaaatgat 1500 
ttagattgct tctttttatt tttctaaaag aataaatata ttctctctta gttttaaaca 1560 
aaattctctt tcttacaggt atctagaaaa tctttccctt atctcgcagc taattttaat 1620 
gatcctcgtg taaccctcgt tctcggagat ggtgcgtatt tataatctcg tttttgtttt 1680 
atcttttatt tttatttcat ttaatttacc tttttgtgtg tggttaattt acccgtcatt 1740 
ggttctcttt catttcaggg gctgcatttg taaaggctgc acaagcagga tattatgatg 1800 
ctattatagt ggactcttct gatcccattg gtactctatt actacttaat accaagacta 1860 
atcttattga ataagctact aataaactgt aattgatttc taaaataata taatttcagg 1920 
tccagcaaaa gatttgtttg agaggccatt ctttgaggca gtagccaaag ccctaaggcc 1980 
aggaggagtt gtatgcacac aggccgaaag catttggctt catatgcata ttattaagca 2040 
aatcattgct aactgtcgtc aagtctttaa gggctctgtc aactacgctt ggactactgt 2100 
tccaacatat cccacgtatt ttctctctct ctctcttcat ctttgaaaat tgaaaatcct 2160 
gactactttc cttcctttga ttcctcggtt aaaggggcgt agatcataag attttcaaga 2220 
aatagataat gacgtccaag aaaaactaac ttcttttcat ttactattct ttttggtgac 2280 
aaactttatt tattatttcg ttctaaagag aaaatttatt tttatatttt aaaataattt 2340 
tgttttaaac ttttattttt acttattata tctttaataa aaaaattata gtcaaataaa 2400 
tattatggcc acactaaaca tccaagtttt tgaaaccata agttttagag ccaaatgagt 2460 
taatttgttt ttggtatgcg ggtgcggagt caaattatgt cacaaaaatt gtaatggagt 2520 
gagcaaattt ttatttcgag taaactttca aggtattgtg ttaaagtttt ttcaactgat 2580 
actaatcaat tatgtctcaa ccattttgat ttcagtggtg taattgggta tatgctctgc 2640 
tctactgaag ggccagaagt tgacttcaag aatccaataa atccaattga caaagagaca 2700 
actcaagtca agtccaaatt agcacctctc aagttttaca attctgatgt aacttcatat 2760 
ctaacaattt ctttttctgt tttactgtat cttcattgtc aaaattttat aattaactct 2820 
tctcaaatta tctttttttt tagattcaca aagcagcatt cattttgcca tctttcgcca 2880 
gaagtatgat cgagtcttaa tcaagtgaat aatgaacact ggtggtgcaa tcattggacc 2940 
aagatcgagt cttaatcaag tgaataaata agtgaaatgc cgacgtattg tatgagaatt 3000 
c ~ 3001 

<210> 12 
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<211> 419 
<212> PRT 
<213> Plant 



<400> 12 

Met Glu Val lie Ser Thr Asn Thr Asn Gly Ser Thr lie Phe Lys Asn 
15 10 15 

Gly Ala lie Pro Met Asn Gly His Gin Ser Gly Thr Ser Lys His Leu 
20 25 30 

Asn Gly Tyr Gin Asn Gly Thr Ser Lys His Gin Asn Gly His His Asn 
35 40 45 

Gly Thr Ser Glu His Arg Asn Gly His Gin Asn Gly lie Ser Glu His 
50 55 60 

Gin Asn Gly His Gin Asn Gly Thr Ser Glu His Arg Asn Gly His Gin 
65 70 75 80 

Asn Gly lie Ser Glu His Gin Asn Gly His Gin Asn Gly Thr Ser Glu 
85 90 95 

His Gin Asn Gly His Gin Asn Gly Thr Ser Glu Gin Gin Asn Gly Thr 
100 105 110 

lie Ser His Asp Asn Gly Asn Glu Leu Leu Gly Asn Ser Asn Ser lie 
115 120 125 

Lys Leu Gly Trp Phe Ser Glu Phe Ser Ala Leu Trp Pro Gly Glu Ala 
130 135 140 

Phe Ser Leu Lys Val Glu Lys Leu Leu Phe Gin Gly Lys Ser Asp Tyr 
145 150 155 160 

Gin Asp Val Met Leu Phe Glu Ser Ala Thr Tyr Gly Lys Val Leu Thr 
165 170 175 

Leu Asp Gly Ala He Gin His Thr Glu Asn Gly Gly Phe Pro Tyr Thr 
180 185 190 

Glu Met He Val His Leu Pro Leu Gly Ser lie Pro Asn Pro Lys Lys 
195 200 205 

Val Leu lie lie Gly Gly Gly He Gly Phe Thr Leu Phe Glu Met Leu 
210 215 220 

Arg Tyr Pro Thr lie Glu Lys He Asp He Val Glu He Asp Asp Val 
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240 



Val Val Asp Val Ser Arg Lys Ser Phe Pro Tyr Leu Ala Ala Asn Phe 
245 250 255 

Asn Asp Pro Arg Val Thr Leu Val Leu Gly Asp Gly Ala Ala Phe Val 
260 265 270 

Lys Ala Ala Gin Ala Gly Tyr Tyr Asp Ala lie lie Val Asp Ser Ser 
275 280 285 

Asp Pro lie Gly Pro Ala Lys Asp Leu Phe Glu Arg Pro Phe Phe Glu 
290 295 300 

Ala Val Ala Lys Ala Leu Arg Pro Gly Gly Val Val Cys Thr Gin Ala 
305 310 315 320 

Glu Ser lie Trp Leu His Met His lie lie Lys Gin lie lie Ala Asn 
325 330 335 

Cys Arg Gin Val Phe Lys Gly Ser Val Asn Tyr Ala Trp Thr Thr Val 
340 345 350 

Pro Thr Tyr Pro Thr Gly Val lie Gly Tyr Met Leu Cys Ser Thr Glu 
355 360 365 

Gly Pro Glu Val Asp Phe Lys Asn Pro lie Asn Pro lie Asp Lys Glu 
370 375 380 

Thr Thr Gin Val Lys Ser Lys Leu Ala Pro Leu Lys Phe Tyr Asn Ser 
385 390 395 400 

Asp lie His Lys Ala Ala Phe lie Leu Pro Ser Phe Ala Arg Ser Met 
405 410 415 

lie Glu Ser 



<210> 13 
<211> 1636 
<212> DNA 
<213> Plant 

<400> 13 

ggcacgagat cagatccaat tctcttctgt gcttcccttc tctgctctca aattcttcag 60 

atctacaaag ttttcttcat tttcagaggg cagacatgga aactttcttg ttcacctcag 120 

agtcagtcaa tgaaggccac cccgacaagc tctgcgacca ggtctcggat gcaattcttg 180 
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atgcttgctt agaacaggat ccagaaagca aggttgcatg tgaaacctgc acaaagacaa 240 
acatggttat ggtctttgga gagatcacaa ccaaggccac tgttgactat gagaagatag 300 
tgcgtgacac atgcagaggc attgggttca cctcagcaga tgttggcctt gacgctgaca 360 
actgcaaggt tcttgtcaac atcgagcagc agagccctga cattgcccaa ggtgttcacg 420 
gtcatcttac caagaaacca gaagagattg gagctggtga ccaaggtcac atgtttggct 480 
atgccactga tgaaacccca gagctcatgc cccttaccca tgtttgggcc actaagcttg 540 
gtgccaagct taccgaagtg aggaagaaca agacttgccc atggctcaga ccagatggca 600 
agacccaagt tactgttgag tacaagaacg acaatggtgc catggtccca attagagttc 660 
acactgttct catctcaact caacatgacg aaactgtcac aaacgaccag attgcccagg 720 
acttgaaaga gcatgtgatc aaacctgtga tcccatctca gtaccttgat gagaatacca 780 
tcttccacct caacccatca ggtcgcttcg tcatcggtgg accacacgga gatgctggac 840 
ttaccggcag gaaaattatc attgacacct acggaggctg gggtgcccat ggaggaggtg 900 
ctttctcagg aaaggaccct actaaggtgg acaggagtgg tgcttatatt gttagacagg 960 
cagcaaagag tgtggtcgcc tcaggacttg ctcgccgctg tattgtgcag gtttcttatg 1020 
ctatcggtgt ggctgaacca ctttccgtgt ttgttgacac ttacaagact ggaacaattc 1080 
cagacaagga tattttgact ctgatcaagg agaactttga cttcaggcct ggaatgatgt 1140 
caatcaacct tgacttgtta agaggaggca acttcaggta ccagaagact gcagcttatg 1200 
gtcactttgg ccgtgatgac cccgacttct catgggagac tgtcaaggtc ctcaagccaa 1260 
aagcttaagt gaggtgtagc cttttggcca ttatttttct tgcagaccaa taaacaagct 1320 
tcatcatatc atgcattggt ggcaggagaa gagaatttgt gtctccattg gaggattcta 138 0 
tgagctctga gtcattgaac attgttattt ttctttcttt ttttttcacc cttttctgca 1440 
gtaccttatt tttattttgt tactgttaag tagcagtgat ttaagttttc cctgttaagt 1500 
agcagtgatt taagttttcc ctgttaagta gctggaatta agtttccatg ttctatcata 1560 
ttatatgtga acttgtcaat tatctcctga ggtgaaagag tccttcaggg aatagtttaa 1620 
aaaaaaaaaa aaaaaa 1636 

<210> 14 
<211> 390 
<212> PRT 
<213> Plant 

<400> 14 

Met Glu Thr Phe Leu Phe Thr Ser Glu Ser Val Asn Glu Gly His Pro 
15 10 15 

Asp Lys Leu Cys Asp Gin Val Ser Asp Ala lie Leu Asp Ala Cys Leu 
20 25 30 

Glu Gin Asp Pro Glu Ser Lys Val Ala Cys Glu Thr Cys Thr Lys Thr 
35 40 45 

Asn Met Val Met Val Phe Gly Glu lie Thr Thr Lys Ala Thr Val Asp 
50 55 60 

Tyr Glu Lys lie Val Arg Asp Thr Cys Arg Gly lie Gly Phe Thr Ser 
65 70 75 80 

Ala Asp Val Gly Leu Asp Ala Asp Asn Cys Lys Val Leu Val Asn lie 
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85 90 95 

Glu Gin Gin Ser Pro Asp lie Ala Gin Gly Val His Gly His Leu Thr 
100 105 110 

Lys Lys Pro Glu Glu lie Gly Ala Gly Asp Gin Gly His Met Phe Gly 
115 120 125 

Tyr Ala Thr Asp Glu Thr Pro Glu Leu Met Pro Leu Thr His Val Trp 
130 135 140 

Ala Thr Lys Leu Gly Ala Lys Leu Thr Glu Val Arg Lys Asn Lys Thr 
145 150 155 160 

Cys Pro Trp Leu Arg Pro Asp Gly Lys Thr Gin Val Thr Val Glu Tyr 
165 170 175 

Lys Asn Asp Asn Gly Ala Met Val Pro lie Arg Val His Thr Val Leu 
180 185 190 

lie Ser Thr Gin His Asp Glu Thr Val Thr Asn Asp Gin lie Ala Gin 
195 200 205 

Asp Leu Lys Glu His Val lie Lys Pro Val lie Pro Ser Gin Tyr Leu 
210 215 220 

Asp Glu Asn Thr lie Phe His Leu Asn Pro Ser Gly Arg Phe Val He 
225 230 235 240 

Gly Gly Pro His Gly Asp Ala Gly Leu Thr Gly Arg Lys He lie He 
245 250 255 

Asp Thr Tyr Gly Gly Trp Gly Ala His Gly Gly Gly Ala Phe Ser Gly 
260 265 270 

Lys Asp Pro Thr Lys Val Asp Arg Ser Gly Ala Tyr He Val Arg Gin 
275 280 285 

Ala Ala Lys Ser Val Val Ala Ser Gly Leu Ala Arg Arg Cys He Val 
290 295 300 

Gin Val Ser Tyr Ala He Gly Val Ala Glu Pro Leu Ser Val Phe Val 
305 310 315 320 

Asp Thr Tyr Lys Thr Gly Thr He Pro Asp Lys Asp He Leu Thr Leu 
325 330 335 

lie Lys Glu Asn Phe Asp Phe Arg Pro Gly Met Met Ser He Asn Leu 
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350 



Asp Leu Leu Arg Gly Gly Asn Phe Arg Tyr Gin Lys Thr Ala Ala Tyr 
355 360 365 

Gly His Phe Gly Arg Asp Asp Pro Asp Phe Ser Trp Glu Thr Val Lys 
370 375 380 

Val Leu Lys Pro Lys Ala 
385 390 



<210> 15 
<211> 1596 
<212> DNA 
<213> Plant 

<400> 15 

ggcacgaggg gaacaagaga aacatcatat 
ttgattcctt cctctcattt acctctctct 
caatcatcgt ttccgggttg aacccggcgg 
cttctcctac agcggcggcg gcggcggaaa 
gagatgcctt acaagatttc atgttatcaa 
aaccttttta cgtgctagac ttgggtgagg 
ctctcccaaa tatccgtcca ttttacgctg 
caattttatc tgctatgggc tcaaattttg 
ttttatctct tggcatttca cctgaccgta 
ccgatattat ttttgcagca aaagttgggg 
aggtttacaa gatccgaaag catcacccga 
tgctcgacgg caacgcgaga tgcccaatgg 
tcgacccgct gctccgggca gctcaagccg 
acatcggtag cggagatgcc gattcaaacg 
aagtgtttga aacagctgct aaactcggga 
gcgggtttac atccggccac cagttcacaa 
aacaacactt cgatgacgaa ccggagttga 
cagagacggc gtttactttg gcaacgacga 
gggagtattg gattaacgac gggctgtacg 
cgacggtgaa tgcaacgccg ttagctgttc 
ggtcgaaaac gtttccgacg actgtgtttg 
taagggatta ccagttaccg gagctgcagg 
gtgcttatac taaagctgct gggtccaatt 
ctcacctcgc ttattcttat ccaagctgat 
gttttgatgg ttttttcctt ttttgggtat 
aatttatatt ccaaatcagc ttgtaattct 
taattgtgat tttctctaaa aaaaaaaaaa 

<210> 16 
<211> 433 



tattgaatcc ctagtttctt ttctttccct 60 
tttcttcctt tgtttggatg gccggccaaa 120 
ccattcttca gtccacaatt ggcggcggag 180 
acggcaccag aaaagtcatc cctctctcaa 240 
tcataaccca aaaattacaa gatgagaaac 300 
ttgtttctct tatggaccaa tggaaatctg 360 
ttaaatgtaa ccctgaaccg tcgttccttt 420 
attgtgctag ccgagctgaa attgagtatg 480 
ttgttttcgc aaatccatgc aaaccggaat 540 
tgaatcttac aacctatgat tctgaagacg 600 
aatccgaact cttgctccgc atcaagccca 660 
gcccgaaata cggcgcgctt ccagaagaag 720 
cccgtctcac cgtatccggc gtctcattcc 780 
cttatctcgg cgccataacc gcggctaagg 840 
tgtcgaaaat gactgttcta gacgtcggcg 900 
ccgccgccgt cgccgttaaa tcagctttaa 960 
caatcatagc tgaaccgggt cggttttttg 1020 
ttatagggaa aagagtgagg ggtgaattga 1080 
gttcgatgaa ctgtgtactt tacgaccatg 1140 
tgtcgaatcg tagtaacgtt acctgcggcg 1200 
ggcccacttg tgatgctctt gatactgttt 1260 
ttaatgattg gctggttttt cctaatatgg 1320 
ttaatggatt taatacttcc gccattgtta 1380 
gaaccacctg tattaggaat tactaccgtg 1440 
ctttttttta attttgttgt ttttggtagt 1500 
cttgtatgcc ataagaatgc aaggatttgc 1560 
aaaaaa 1596 
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<212> PRT 
<213> Plant 

<400> 16 

Met Ala Gly Gin Thr lie lie Val Ser Gly Leu Asn Pro Ala Ala lie 
15 10 15 

Leu Gin Ser Thr lie Gly Gly Gly Ala Ser Pro Thr Ala Ala Ala Ala 
20 25 30 

Ala Glu Asn Gly Thr Arg Lys Val lie Pro Leu Ser Arg Asp Ala Leu 
35 40 45 

Gin Asp Phe Met Leu Ser lie lie Thr Gin Lys Leu Gin Asp Glu Lys 
50 55 60 

Gin Pro Phe Tyr Val Leu Asp Leu Gly Glu Val Val Ser Leu Met Asp 
65 70 75 80 

Gin Trp Lys Ser Ala Leu Pro Asn lie Arg Pro Phe Tyr Ala Val Lys 
85 90 95 

Cys Asn Pro Glu Pro Ser Phe Leu Ser lie Leu Ser Ala Met Gly Ser 
100 105 110 

Asn Phe Asp Cys Ala Ser Arg Ala Glu lie Glu Tyr Val Leu Ser Leu 
115 120 125 

Gly lie Ser Pro Asp Arg lie Val Phe Ala Asn Pro Cys Lys Pro Glu 
130 135 140 

Ser Asp lie lie Phe Ala Ala Lys Val Gly Val Asn Leu Thr Thr Tyr 
145 150 155 160 

Asp Ser Glu Asp Glu Val Tyr Lys lie Arg Lys His His Pro Lys Ser 
165 170 175 

Glu Leu Leu Leu Arg lie Lys Pro Met Leu Asp Gly Asn Ala Arg Cys 
180 185 190 

Pro Met Gly Pro Lys Tyr Gly Ala Leu Pro Glu Glu Val Asp Pro Leu 
195 200 205 

Leu Arg Ala Ala Gin Ala Ala Arg Leu Thr Val Ser Gly Val Ser Phe 
210 215 220 

His He Gly Ser Gly Asp Ala Asp Ser Asn Ala Tyr Leu Gly Ala He 
225 230 235 240 
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Ala Ala Ala Lys Glu Val Phe Glu Thr Ala Ala Lys Leu Gly Met Ser 
245 250 255 

Lys Met Thr Val Leu Asp Val Gly Gly Gly Phe Thr Ser Gly His Gin 
260 265 270 

Phe Thr Thr Ala Ala Val Ala Val Lys Ser Ala Leu Lys Gin His Phe 
275 280 285 

Asp Asp Glu Pro Glu Leu Thr lie lie Ala Glu Pro Gly Arg Phe Phe 
290 295 300 

Ala Glu Thr Ala Phe Thr Leu Ala Thr Thr lie lie Gly Lys Arg Val 
305 310 315 320 

Arg Gly Glu Leu Arg Glu Tyr Trp lie Asn Asp Gly Leu Tyr Gly Ser 
325 330 335 

Met Asn Cys Val Leu Tyr Asp His Ala Thr Val Asn Ala Thr Pro Leu 
340 345 350 

Ala Val Leu Ser Asn Arg Ser Asn Val Thr Cys Gly Gly Ser Lys Thr 
355 360 365 

Phe Pro Thr Thr Val Phe Gly Pro Thr Cys Asp Ala Leu Asp Thr Val 
370 375 380 

Leu Arg Asp Tyr Gin Leu Pro Glu Leu Gin Val Asn Asp Trp Leu Val 
385 390 395 400 

Phe Pro Asn Met Gly Ala Tyr Thr Lys Ala Ala Gly Ser Asn Phe Asn 
405 410 415 

Gly Phe Asn Thr Ser Ala lie Val Thr His Leu Ala Tyr Ser Tyr Pro 
420 425 430 

Ser 



<210> 17 
<211> 2074 
<212> DNA 
<213> Plant 

<400> 17 

tggtaactgg accgacgcga catttgtcgt atatgtctta atcgggctag tcgctgacaa 60 
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catcatccac caagtcaaag ttcggaaatt catatcgttt ctcatcatct tctatccgag 120 
aaatgagggg actatctgta tacggtcaaa accgagtctg cccttcatat gactaatcga 180 
gattagaaca taatggtcta aggttcatca ttataataac gagccatgat atagagttag 240 
gttgtcaagc tcaagcccca gagagcgatc aatatcgaga tcgagccaag gttaaactcg 300 
agacctagag accgatcaat accgagaccg accaagtcaa ggtcgagctc gagacccaga 360 
gaccggtcaa gattgagatc ggccaagatc gagatcgagc caagaaatta aaaagtcgtt 420 
atagccgcat ttagggagag aatctctgcg gaaatcacga cttgaatcag ggaaaaacta 480 
attaattaat ctatcatgtg atccccacta tgtattttta attatactca aaatgggatt 540 
cccccactat attaagagtg gttatcattt gtaatggaga gacatacaca cattcattct 600 
gacatataca gaaaatagag caaatactat ccttttttgg cttttgatat ttagtcatat 660 
tgtttcttct acccattgtt cttcactcaa tttggaggtg ataaaacttg aaggtttaag 720 
ctaactagtc cattcgggtt gcattcattt cttttacaat aatttcgtca tcatttattt 780 
attttctcaa ttgtactaag ttataccacg tatttttaga actgcgtata aattcaactc 840 
tatccatttt tcgggtaaac accgaataca tagcacaata gcaccctcaa ttgcaaaagt 900 
ccaaagccaa gggttcattc ctttctgaag aaatgagata gagaattgaa aatctaattt 960 
agttatctaa atctttataa tttagccttc catataagaa aaaggaaaca aattaactga 1020 
agaacaatag cctcgcatag atttaccttc tccatataaa ttttgtttat actcaatttt 1080 
tttgcaaatg tgtctaaaat gataggactt gcaaattttt atttaacatt tcctactcct 1140 
ctttaatttt caagaaatta attttaagca ttctcgattt gctctgcccg ctccgtcccg 1200 
ttgccatctc tgactcggat aggacctcga ttgcaaaaat ccaaacccaa ggaaccttcc 1260 
atacattaca taagccacaa aatagtaact attaaaaact accaatatat cctcaaatac 1320 
tcgcgattat ttcataccta acacgtttac cttatcttct cgtaatgacg ctacattagt 1380 
tagtgatata aaataccgaa tttaccacgc ggcaaccctc cgctgtctat ccacggcccg 1440 
agagaatctc ttagcccccc aaatacgaaa attaacttct agaattttat tttctggtta 1500 
ttaccatgaa aataaagaaa aagagaaaag tcaagaaatt taattgggct aatactgggg 1560 
tccactgccc agccacgcat ttccctccta tataaagcgt cgtcacctct catgcaaatc 1620 
tcgctcactt cacagttgtt agtttcacgt tctcttctca attcccataa aagaaaccct 1680 
tccgttaggt ttccgtccta ttttctcttc ttctacgctt cctcttctga tatcaatatc 1740 
tgtatggtgt ttttcttgtt cgaattttag atttgttttg cctttaatac ctgtaacctt 1800 
ataattctct gtttaaacca aaaacttagc ttcttctgaa gtcagggtgg ggatatttgg 1860 
atcgtgtaag agtgtgttag aaggtgatta tcttttgatt cagttccttt tttgcttctt 1920 
ttgagggggt agccggggcc tcggcctcgg cgggttttaa tagcccccat ctattacaac 1980 
cattgggcaa aaacatcatt aaatctgtac aaagcaaacc cttaatttag tttaattttc 2040 
tgtattcttt gattctttaa cagaagaaga agag 2074 



<210> 18 
<211> 4321 
<212> DNA 
<213> Plant 



<400> 18 

tggtaactgg accgacgcga catttgtcgt 

catcatccac caagtcaaag ttcggaaatt 

aaatgagggg actatctgta tacggtcaaa 

gattagaaca taatggtcta aggttcatca 

gttgtcaagc tcaagcccca gagagcgatc 

agacctagag accgatcaat accgagaccg 

gaccggtcaa gattgagatc ggccaagatc 



atatgtctta atcgggctag tcgctgacaa 60 
catatcgttt ctcatcatct tctatccgag 120 
accgagtctg cccttcatat gactaatcga 180 
ttataataac gagccatgat atagagttag 240 
aatatcgaga tcgagccaag gttaaactcg 300 
accaagtcaa ggtcgagctc gagacccaga 360 
gagatcgagc caagaaatta aaaagtcgtt 420 
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atagccgcat ttagggagag aatctctgcg 
attaattaat ctatcatgtg atccccacta 
cccccactat attaagagtg gttatcattt 
gacatataca gaaaatagag caaatactat 
tgtttcttct acccattgtt cttcactcaa 
ctaactagtc cattcgggtt gcattcattt 
attttctcaa ttgtactaag ttataccacg 
tatccatttt tcgggtaaac accgaataca 
ccaaagccaa gggttcattc ctttctgaag 
agttatctaa atctttataa tttagccttc 
agaacaatag cctcgcatag atttaccttc 
tttgcaaatg tgtctaaaat gataggactt 
ctttaatttt caagaaatta attttaagca 
ttgccatctc tgactcggat aggacctcga 
atacattaca taagccacaa aatagtaact 
tcgcgattat ttcataccta acacgtttac 
tagtgatata aaataccgaa tttaccacgc 
agagaatctc ttagcccccc aaatacgaaa 
ttaccatgaa aataaagaaa aagagaaaag 
tccactgccc agccacgcat ttccctccta 
tcgctcactt cacagttgtt agtttcacgt 
tccgttaggt ttccgtccta ttttctcttc 
tgtatggtgt ttttcttgtt cgaattttag 
ataattctct gtttaaacca aaaacttagc 
atcgtgtaag agtgtgttag aaggtgatta 
ttgagggggt agccggggcc tcggcctcgg 
cattgggcaa aaacatcatt aaatctgtac 
tgtattcttt gattctttaa cagaagaaga 
cgctactgtt tcccctcctc tcggctatgc 
ggagttcttt acctccggcg tacctcctac 
ggatctgtcc tctgctttgt acggggtcga 
ctctaacgga gatatctccg tccgaccaca 
tgaccttctc aaggtcgtga aaaaggcctc 
tcagctgcct cttgttgttc gcttccctga 
atcggctttt gatctcgctg ttcattccca 
tcccgtgaaa tgcaatcaag acaggttcgt 
attccggttc gggttggaag ctgggtctaa 
ctgcaggggc agtgctgagg gccttctcgt 
ttcgcttgct ttggttgcaa gaaagctcat 
ggaggagctt gaccttgtga ttgatataag 
acttcgggct aagctcagga ccaagcattc 
aggtaagttt gggcttacaa cgacccaaat 
cggaatgctg gattgccttc agttgctgca 
ggcgttgctt gctgatggtg ttggtgaggc 
tggtgcgggt atgaagttca ttgatactgg 
taaatcatgt gattcagatg tctctgttgg 
tgtccaggcg gttcaatatg tttgcgaccg 
cgaaagtggc agggcaattg tttctcatca 



gaaatcacga cttgaatcag ggaaaaacta 480 
tgtattttta attatactca aaatgggatt 540 
gtaatggaga gacatacaca cattcattct 600 
ccttttttgg cttttgatat ttagtcatat 660 
tttggaggtg ataaaacttg aaggtttaag 720 
cttttacaat aatttcgtca tcatttattt 780 
tatttttaga actgcgtata aattcaactc 840 
tagcacaata gcaccctcaa ttgcaaaagt 900 
aaatgagata gagaattgaa aatctaattt 960 
catataagaa aaaggaaaca aattaactga 1020 
tccatataaa ttttgtttat actcaatttt 1080 
gcaaattttt atttaacatt tcctactcct 1140 
ttctcgattt gctctgcccg ctccgtcccg 1200 
ttgcaaaaat ccaaacccaa ggaaccttcc 1260 
attaaaaact accaatatat cctcaaatac 1320 
cttatcttct cgtaatgacg ctacattagt 1380 
ggcaaccctc cgctgtctat ccacggcccg 1440 
attaacttct agaattttat tttctggtta 1500 
tcaagaaatt taattgggct aatactgggg 1560 
tataaagcgt cgtcacctct catgcaaatc 1620 
tctcttctca attcccataa aagaaaccct 1680 
ttctacgctt cctcttctga tatcaatatc 1740 
atttgttttg cctttaatac ctgtaacctt 1800 
ttcttctgaa gtcagggtgg ggatatttgg 1860 
tcttttgatt cagttccttt tttgcttctt 1920 
cgggttttaa tagcccccat ctattacaac 1980 
aaagcaaacc cttaatttag tttaattttc 2040 
agagatgccg gccctaggtt gttgcgtaga 2100 
cttctctcgg gatagctctc ttcccgcgcc 2160 
aaactccgcc gccggttcca ttgggtctcc 2220 
tgggtgggga gctccttatt tctccgttaa 2280 
tggtacggac acactcccee accaggaaat 2340 
cgacccgaaa aattcagggg ggctcgggct 2400 
tgtgctaaaa aaccggttgg aatctctgca 2460 
gggctatggg gcccactacc aaggtgttta 2520 
ggtggaagat attgtcaaat tcgggtcgtc 2580 
acccgagctc ctgttagcca tgagctgtct 2640 
ttgcaatggt ttcaaggacg ctgagtacat 2700 
gttaaacact gtaattgttc ttgaacaaga 2760 
ccgtaagatg gctgttcggc ccgtaattgg 2820 
aggccatttt ggatccactt ctggagaaaa 2880 
tgttcgtgta gtgaagaagc tggaagaatc 2940 
ttttcacatt ggatctcaga tcccttcaac 3000 
tgctcagatt tattgtgaat taatccgtct 3060 
aggtgggctc ggaattgatt atgatggtac 3120 
ctatggcatt caagaatacg cctccacagt 3180 
taagggcgtg aagcacccag tgatttgcag 3240 
ctcaattctg attttcgaag ccgtgtctgc 3300 



21 



WO 00/67558 



PCT/US00/12450 



ttctagtcac tcatgttctt cttcacatct gtcttctggt ggcctccaat ccatggcgga 3360 
gacgctcaat gaagatgccc ttgctgatta ccgcaattta tctgctgctg cagttcgtgg 3420 
agagtacgag acgtgtgtac tttactctga tcagttgaaa cagagatgtg tggatcagtt 3480 
taaagaaggg tccttgggta ttgaacatct tgctgctgtt gatagcatct gtgattttgt 3540 
atcaaaggct atgggggctg ctgatcctat ccgcacttac catgtgaatc tgtcaatttt 3600 
cacttcaatt cctgattttt gggcctttgg tcaattgttt ccgattgttc caatacaccg 3660 
tttagatgaa aagcctgcag taaggggaat attatcggac ttgacttgtg acagtgatgg 3720 
gaaggttgat aagttcattg gtggcgaatc aagcttgcag ctgcatgaat tgggaagtaa 3780 
tggcgatggt ggtgggtatt atctggggat gtttttgggt ggggcttatg aggaggcgct 3840 
cggaggactc cacaacctgt ttggtggacc aagcgtggtg cgcgtggtgc agagcgatag 3900 
cgctcacagc ttcgccatgt ctcgctccgt ccctggcccg tcctgcgcgg acgtgctccg 3960 
agcgatgcag cacgagcccg agctcatgtt cgagactctc aagcaccgtg cggaggaatt 4020 
cttggaacaa gaagaagaca aagggctggc cattgcatct ttggccagca gcttagctca 4080 
gtccttccat aacatgcctt accttgtggc gcctgcatct tgctgcttca ctgcagttac 4140 
tgctaacaac ggtggctata actactatta cagtgatgag aatgcagcag attctgctac 4200 
aggggaggat gagatttggt cctattgcac tgcttgaagt gttgtcgtag catctccagt 4260 
tttagtttgt cgtcgaagtt gtctgttttt gaataatacc cttagttggt gatgtttttc 4320 
t 4321 

<210> 19 

<211> 720 

<212> PRT 

<213> Plant 

<400> 19 

Met Pro Ala Leu Gly Cys Cys Val Asp Ala Thr Val Ser Pro Pro Leu 
15 10 15 

Gly Tyr Ala Phe Ser Arg Asp Ser Ser Leu Pro Ala Pro Glu Phe Phe 
20 25 30 

Thr Ser Gly Val Pro Pro Thr Asn Ser Ala Ala Gly Ser lie Gly Ser 
35 40 45 

Pro Asp Leu Ser Ser Ala Leu Tyr Gly Val Asp Gly Trp Gly Ala Pro 
50 55 60 

Tyr Phe Ser Val Asn Ser Asn Gly Asp lie Ser Val Arg Pro His Gly 
65 70 75 80 

Thr Asp Thr Leu Pro His Gin Glu lie Asp Leu Leu Lys Val Val Lys 
85 90 95 

Lys Ala Ser Asp Pro Lys Asn Ser Gly Gly Leu Gly Leu Gin Leu Pro 
100 105 110 

Leu Val Val Arg Phe Pro Asp Val Leu Lys Asn Arg Leu Glu Ser Leu 
115 120 125 
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Gin Ser Ala Phe Asp 
130 

Tyr Gin Gly Val Tyr 
145 

Glu Asp lie Val Lys 
165 

Gly Ser Lys Pro Glu 
180 

Ser Ala Glu Gly Leu 
195 

lie Ser Leu Ala Leu 
210 

Val Leu Glu Gin Glu 
225 

Lys Met Ala Val Arg 
245 

Lys His Ser Gly His 
260 

Gly Leu Thr Thr Thr 
275 

Ser Gly Met Leu Asp 
290 

Gin lie Pro Ser Thr 
305 

Gin lie Tyr Cys Glu 
325 

Asp Thr Gly Gly Gly 
340 

Asp Ser Asp Val Ser 
355 

Val Val Gin Ala Val 
370 



Leu Ala Val His Ser Gin 
135 

Pro Val Lys Cys Asn Gin 
150 155 

Phe Gly Ser Ser Phe Arg 
170 

Leu Leu Leu Ala Met Ser 
185 

Leu Val Cys Asn Gly Phe 
200 

Val Ala Arg Lys Leu Met 
215 

Glu Glu Leu Asp Leu Val 
230 235 

Pro Val lie Gly Leu Arg 
250 

Phe Gly Ser Thr Ser Gly 
265 

Gin lie Val Arg Val Val 
280 

Cys Leu Gin Leu Leu His 
295 

Ala Leu Leu Ala Asp Gly 
310 315 

Leu lie Arg Leu Gly Ala 
330 

Leu Gly lie Asp Tyr Asp 
345 

Val Gly Tyr Gly lie Gin 
360 

Gin Tyr Val Cys Asp Arg 
375 



Gly Tyr Gly Ala His 
140 

Asp Arg Phe Val Val 
160 

Phe Gly Leu Glu Ala 
175 

Cys Leu Cys Arg Gly 
190 

Lys Asp Ala Glu Tyr 
205 

Leu Asn Thr Val lie 
220 

lie Asp lie Ser Arg 
240 

Ala Lys Leu Arg Thr 
255 

Glu Lys Gly Lys Phe 
270 

Lys Lys Leu Glu Glu 
285 

Phe His lie Gly Ser 
300 

Val Gly Glu Ala Ala 
320 

Gly Met Lys Phe lie 
335 

Gly Thr Lys Ser Cys 
350 

Glu Tyr Ala Ser Thr 
365 

Lys Gly Val Lys His 
380 
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Pro Val lie Cys Ser Glu Ser Gly Arg Ala lie Val Ser His His Ser 
385 390 395 400 

lie Leu lie Phe Glu Ala Val Ser Ala Ser Ser His Ser Cys Ser Ser 
405 410 415 

Ser His Leu Ser Ser Gly Gly Leu Gin Ser Met Ala Glu Thr Leu Asn 
420 425 430 

Glu Asp Ala Leu Ala Asp Tyr Arg Asn Leu Ser Ala Ala Ala Val Arg 
435 440 445 

Gly Glu Tyr Glu Thr Cys Val Leu Tyr Ser Asp Gin Leu Lys Gin Arg 
450 455 460 

Cys Val Asp Gin Phe Lys Glu Gly Ser Leu Gly lie Glu His Leu Ala 
465 470 475 480 

Ala Val Asp Ser lie Cys Asp Phe Val Ser Lys Ala Met Gly Ala Ala 
485 490 495 

Asp Pro lie Arg Thr Tyr His Val Asn Leu Ser lie Phe Thr Ser lie 
500 505 510 

Pro Asp Phe Trp Ala Phe Gly Gin Leu Phe Pro lie Val Pro lie His 
515 520 525 

Arg Leu Asp Glu Lys Pro Ala Val Arg Gly lie Leu Ser Asp Leu Thr 
530 535 540 

Cys Asp Ser Asp Gly Lys Val Asp Lys Phe lie Gly Gly Glu Ser Ser 
545 550 555 560 

Leu Gin Leu His Glu Leu Gly Ser Asn Gly Asp Gly Gly Gly Tyr Tyr 
565 570 575 

Leu Gly Met Phe Leu Gly Gly Ala Tyr Glu Glu Ala Leu Gly Gly Leu 
580 585 590 

His Asn Leu Phe Gly Gly Pro Ser Val Val Arg Val Val Gin Ser Asp 
595 600 605 

Ser Ala His Ser Phe Ala Met Ser Arg Ser Val Pro Gly Pro Ser Cys 
610 615 620 

Ala Asp Val Leu Arg Ala Met Gin His Glu Pro Glu Leu Met Phe Glu 
625 630 635 640 
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Thr Leu Lys His Arg Ala Glu Glu 
645 

Gly Leu Ala lie Ala Ser Leu Ala 
660 

Asn Met Pro Tyr Leu Val Ala Pro 
675 680 

Thr Ala Asn Asn Gly Gly Tyr Asn 
690 695 

Ala Asp Ser Ala Thr Gly Glu Asp 
705 710 



Phe Leu Glu Gin Glu Glu Asp Lys 
650 655 

Ser Ser Leu Ala Gin Ser Phe His 
665 670 

Ala Ser Cys Cys Phe Thr Ala Val 
685 

Tyr Tyr Tyr Ser Asp Glu Asn Ala 
700 

Glu He Trp Ser Tyr Cys Thr Ala 
715 720 



<210> 20 
<211> 2118 
<212> DNA 
<213> Plant 



<400> 20 

gaattcctta tccggatttc tggtacgcag 
tcgggattaa aattaggtga cttgggacac 
aaataaacaa atcccgtttc gattgtcctt 
gggtacggaa aaaggaggtg tacagcaatg 
ggaatcaact tgatcaaaat ttatgggtga 
ggactttagc agatgtggtc acttcaattt 
aaaaagtact agaaatttga gtcataaagc 
cttattaaat caaatgtact ttattaatgt 
gtcctcaccc cacaaaaagg agatagagaa 
ataaaattta tatccttgta taaatcccca 
tttaatcatc cgtataagaa agaagctaat 
aatagcactc tcaattacaa aaatccaaag 
gatagggaat ggaaaatata atttaattat 
agaaaaagga aacaaattaa ctgaagagca 
atgggtgggg aaaccgacaa accgcaccaa 
aaaattcgac tatggtttgg tttgatttgg 
ttggtttggt ttggttttaa ctaaagaaag 
catatataaa ttttttagat atatttaata 
aaatatttct taaaaatatt cataatttta 
ttttgaatgt ttcttactcc tctttaattt 
tgctccgccc ccgctccgtc ccgttgccat 
attgcaaaaa tccaaaccca aggaaccttc 



actgtaatat ggagtcatct tctcctcgat 60 
cctaaatctc ccaagtggcg actctgaaat 120 
aaattggaaa aaactccctt gtaccctccc 180 
acccaaaact tttattgcta tacattttga 240 
aattcaatgt ggtatgattt atattaggtc 300 
gcggcaaaaa taatgtacag ggataataat 360 
tttttcaatt ttacaaaaga tattaagata 420 
aatagcatga aaaaacagcc tcatccgcct 480 
aggaaactaa tcttatttaa ttttccacat 540 
aaaaaaaaaa atcaatacta attattttaa 600 
taactgactt acaaactgaa tagatagcac 660 
ccgagggtca ttcctttcat caagaaatta 720 
ctgaatcttt ataatttatc cttccatata 780 
tatagcctcg catagattta ccttctccat 840 
ttcgataatt cgagtcaaac tgaggaaaaa 900 
tttattgttg ggataaaaaa tcgatcataa 960 
tcaaaccgaa accaaacaaa cccgacatta 1020 
tataaatata cttgttgtga tgtaatttat 1080 
tcttttaaga tattatttcg tacttagaac 1140 
gttagaaatt aatttgaagg agtttgaatt 1200 
ccctgactca ataggataac agcaatctcg 1260 
ccaacattac ataagctaca aagtagagta 1320 
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gtttattaaa taactaccaa tatatcctca aattctcgcg attatttcat acctaacacg 1380 
cttaccttat cttctcgtaa tgacgctaca ttagttggtg atataaaata ccgaatttgc 1440 
cacgcggcaa tcctccgctg tctatccacg gcccgagaga atctcttagc cccccaaaga 1500 
tgaaaattaa cttctagaat tttattttct ggttattacc atgaaaataa ttaaataaaa 1560 
aaaaagagaa aagtaaagat atttaattgg gctaaaactg gggtccacgg cccagccacg 1620 
catttccctc ctatataaag cgtcgtcacc tctcatgcaa atctcgctca ctacacagtt 1680 
gttagtttca cgttctcttc tcaattccca taacagaaac ccttccgtta ggtttccgtc 1740 
ctatttttcc tcatcttctc cgtttcctct tctgaaatca atatctgtat ggtgtttttc 1800 
ttgttcgaat tttagatttg ttttgtcttt aatacctata accttaaatt ctctgtttaa 1860 
accaaaaact tagcttcttc tgaagtcagg gtggggattt ttggatcgtg taagagtgtg 1920 
ttagagggtg attatctttt gattcagttc cttttttgct tcttttgagg gggtagccgg 1980 
ggcctcggcc tcggcgggtt ttaatagccc ccatctatta caactattgg gcaaaaacat 2040 
cattaaatct gtacaaaaca aacccttaat ttagtttaat tttctgtatt cattgatttt 2100 
ttaacagaag aagaagag 2118 

<210> 21 

<211> 4368 

<212> DNA 

<213> Plant 

<400> 21 

gaattcctta tccggatttc tggtacgcag actgtaatat ggagtcatct tctcctcgat 60 
tcgggattaa aattaggtga cttgggacac cctaaatctc ccaagtggcg actctgaaat 120 
aaataaacaa atcccgtttc gattgtcctt aaattggaaa aaactccctt gtaccctccc 180 
gggtacggaa aaaggaggtg tacagcaatg acccaaaact tttattgcta tacattttga 240 
ggaatcaact tgatcaaaat ttatgggtga aattcaatgt ggtatgattt atattaggtc 300 
ggactttagc agatgtggtc acttcaattt gcggcaaaaa taatgtacag ggataataat 360 
aaaaagtact agaaatttga gtcataaagc tttttcaatt ttacaaaaga tattaagata 420 
cttattaaat caaatgtact ttattaatgt aatagcatga aaaaacagcc tcatccgcct 480 
gtcctcaccc cacaaaaagg agatagagaa aggaaactaa tcttatttaa ttttccacat 540 
ataaaattta tatccttgta taaatcccca aaaaaaaaaa atcaatacta attattttaa 600 
tttaatcatc cgtataagaa agaagctaat taactgactt acaaactgaa tagatagcac 660 
aatagcactc tcaattacaa aaatccaaag ccgagggtca ttcctttcat caagaaatta 720 
gatagggaat ggaaaatata atttaattat ctgaatcttt ataatttatc cttccatata 780 
agaaaaagga aacaaattaa ctgaagagca tatagcctcg catagattta ccttctccat 840 
atgggtgggg aaaccgacaa accgcaccaa ttcgataatt cgagtcaaac tgaggaaaaa 900 
aaaattcgac tatggtttgg tttgatttgg tttattgttg ggataaaaaa tcgatcataa 960 
ttggtttggt ttggttttaa ctaaagaaag tcaaaccgaa accaaacaaa cccgacatta 1020 
catatataaa ttttttagat atatttaata tataaatata cttgttgtga tgtaatttat 1080 
aaatatttct taaaaatatt cataatttta tcttttaaga tattatttcg tacttagaac 1140 
ttttgaatgt ttcttactcc tctttaattt gttagaaatt aatttgaagg agtttgaatt 1200 
tgctccgccc ccgctccgtc ccgttgccat ccctgactca ataggataac agcaatctcg 1260 
attgcaaaaa tccaaaccca aggaaccttc ccaacattac ataagctaca aagtagagta 1320 
gtttattaaa taactaccaa tatatcctca aattctcgcg attatttcat acctaacacg 1380 
cttaccttat cttctcgtaa tgacgctaca ttagttggtg atataaaata ccgaatttgc 1440 
cacgcggcaa tcctccgctg tctatccacg gcccgagaga atctcttagc cccccaaaga 1500 
tgaaaattaa cttctagaat tttattttct ggttattacc atgaaaataa ttaaataaaa 1560 
aaaaagagaa aagtaaagat atttaattgg gctaaaactg gggtccacgg cccagccacg 1620 
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catttccctc ctatataaag cgtcgtcacc 
gttagtttca cgttctcttc tcaattccca 
ctatttttcc tcatcttctc cgtttcctct 
ttgttcgaat tttagatttg ttttgtcttt 
accaaaaact tagcttcttc tgaagtcagg 
ttagagggtg attatctttt gattcagttc 
ggcctcggcc tcggcgggtt ttaatagccc 
cattaaatct gtacaaaaca aacccttaat 
ttaacagaag aagaagagat gccggcccta 
cctcctctca gctatgcctt ctctcgggat 
tccggcgtac ctccgacaaa ttctgccgct 
gctttatacg gggtcgatgg gtggggagct 
atctccgtcc gaccacacgg tacggacact 
gtcgtgaaaa aggcctccga cccgaaaaat 
gttgttcgct tccctgatgt gttgaaaaac 
ctcgcggttc attcccaggg ctatggggcc 
aatcaagaca ggttcgtggt ggaagatatc 
ttggaagccg ggtctaaacc cgagctcctg 
gctgagggcc ttctcgtttg caatggtttc 
gttgcaagaa agctcatgtt aaacactgta 
cttgtgattg atataagcca taagatggct 
ctcaggacca agcattcagg ccattttgga 
cttacaacga cccaaattgt tcgtgtggtg 
tgtcttcagt tgctgcattt tcacattgga 
gatggagttg gtgaggccgc tcagatttat 
aagttcattg atattggagg tgggcttgga 
tctgatgtct ctgttggcta tggcattcaa 
caatatgtat gcgaccgtaa gggtgtgaaa 
gcaattgttt ctcatcactc aattctgatt 
tgttcttctt cacatctgtc ttctggtggc 
gatgcccttg ctgattaccg caatttatct 
tgtgtacttt actctgatca gttgaaacag 
ttgggtattg aacatcttgc tgctgttgat 
ggggctgctg atcctgtccg cacttaccat 
gatttttggg cctttggtca attgtttccg 
cctgcagtga ggggaatatt atcggactta 
ttcattggtg gcgaatcaag cttgccgcta 
ggttattatc tggggatgtt tttgggtggg 
aacctgtttg gtggaccaag tgtcgtgcgc 
gccatgactc gctccgtccc tggcccgtct 
gagcccgagc tcatgttcga gactctcaag 
gatgacaaag ggctggctgt tgaatctttg 
atgccttacc ttgtggcgcc ttcatcttgc 
ggctataatt actattacag tgatgagaat 
atttggtcct attgcactgc ttgaagtgtt 
cgaggttgtc tgtttttgaa taataccctt 



tctcatgcaa atctcgctca ctacacagtt 1680 
taacagaaac ccttccgtta ggtttccgtc 1740 
tctgaaatca atatctgtat ggtgtttttc 1800 
aatacctata accttaaatt ctctgtttaa 1860 
gtggggattt ttggatcgtg taagagtgtg 1920 
cttttttgct tcttttgagg gggtagccgg 1980 
ccatctatta caactattgg gcaaaaacat 2040 
ttagtttaat tttctgtatt cattgatttt 2100 
ggttgttgtg tagatgctgc tgttgtttcc 2160 
agctctcttc ccgcgccgga gttctttgcc 2220 
gcttccattg ggtctccgga tttgtcgtct 2280 
ccttatttct ctgttaactc taatggagat 2340 
ctccctcacc aggaaattga ccttctcaag 2400 
tcaggtgggc ttgggcttca gctgcctctt 2460 
cggttggaat ctctgcaatc ggcttttgat 2520 
cactaccaag gtgtttatcc cgtgaaatgc 2580 
gtgaaattcg ggtcgccatt ccggttcggg 2640 
ttagccatga gctgtctctg caagggcagt 2700 
aaggacgctg agtacatttc gcttgctttg 2760 
attgtgcttg aacaagagga ggagcttgac 2820 
gttcggcctg taattggact tcgggctaag 2880 
tccacttctg gagaaaaagg taagtttggg 2940 
aagaagctag aagaatccgg aatgctggat 3000 
tctcagatcc cttctacggg gttgctagct 3060 
tgtgaattag tccgtcttgg agcgggtatg 3120 
attgattatg atggtactaa atcatgcgat 3180 
gaatatgcct ccgcagttgt tcaggcggtt 3240 
cacccagtga tctgcagcga aagtggcagg 3300 
ttcgaagccg tgtctgcttc tagtcactca 3360 
ctccaatcca tggcggagac gctcaacgaa 3420 
gctgctgcag ttcgtggaga gtatgagaca 3480 
agatgtgtgg atcagtttaa agaagggtcc 3540 
agcatctgtg attttgtatc aaaggctatg 3600 
gtgaatctgt caattttcac ttcaattcct 3660 
attgttccaa ttcaccgctt agatgaaaag 3720 
acttgtgaca gtgatgggaa ggttgataag 3780 
catgaattgg gaagtaatgg cgatggtggt 3840 
gcttatgagg aggcgctcgg aggactccac 3900 
gtggtgcaga gcgatagcgc tcacagcttt 3960 
tgcgctgatg tgctccgagc gatgcagcac 4020 
caccgtgcgg aggaattctt ggaacaagaa 4080 
gccagcagcg tagctcagtc cttccataac 4140 
cgcttcactg ctgctactga taacaatggt 4200 
gcagcagatt ctgctacagg ggaggatgag 4260 
ctcgtagcat ctccagtctt agtttgtcgt 4320 
agttggtgat gtttttct 4368 



<210> 22 
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<211> 721 
<212> PRT 
<213> Plant 

<400> 22 

Met Pro Ala Leu Gly Cys Cys Val Asp Ala Ala Val Val Ser Pro Pro 
15 10 15 

Leu Ser Tyr Ala Phe Ser Arg Asp Ser Ser Leu Pro Ala Pro Glu Phe 
20 25 30 

Phe Ala Ser Gly Val Pro Pro Thr Asn Ser Ala Ala Ala Ser lie Gly 
35 40 45 

Ser Pro Asp Leu Ser Ser Ala Leu Tyr Gly Val Asp Gly Trp Gly Ala 
50 55 60 

Pro Tyr Phe Ser Val Asn Ser Asn Gly Asp lie Ser Val Arg Pro His 
65 70 75 80 

Gly Thr Asp Thr Leu Pro His Gin Glu lie Asp Leu Leu Lys Val Val 
85 90 95 

Lys Lys Ala Ser Asp Pro Lys Asn Ser Gly Gly Leu Gly Leu Gin Leu 
100 105 110 

Pro Leu Val Val Arg Phe Pro Asp Val Leu Lys Asn Arg Leu Glu Ser 
115 120 125 

Leu Gin Ser Ala Phe Asp Leu Ala Val His Ser Gin Gly Tyr Gly Ala 
130 135 140 

His Tyr Gin Gly Val Tyr Pro Val Lys Cys Asn Gin Asp Arg Phe Val 
145 150 155 160 

Val Glu Asp lie Val Lys Phe Gly Ser Pro Phe Arg Phe Gly Leu Glu 
165 170 175 

Ala Gly Ser Lys Pro Glu Leu Leu Leu Ala Met Ser Cys Leu Cys Lys 
180 185 190 

Gly Ser Ala Glu Gly Leu Leu Val Cys Asn Gly Phe Lys Asp Ala Glu 
195 200 205 

Tyr lie Ser Leu Ala Leu Val Ala Arg Lys Leu Met Leu Asn Thr Val 
210 215 220 

lie Val Leu Glu Gin Glu Glu Glu Leu Asp Leu Val lie Asp lie Ser 
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His Lys Met Ala Val Arg Pro Val lie Gly Leu Arg Ala Lys Leu Arg 
245 250 255 

Thr Lys His Ser Gly His Phe Gly Ser Thr Ser Gly Glu Lys Gly Lys 
260 265 270 

Phe Gly Leu Thr Thr Thr Gin lie Val Arg Val Val Lys Lys Leu Glu 
275 280 285 

Glu Ser Gly Met Leu Asp Cys Leu Gin Leu Leu His Phe His lie Gly 
290 295 300 

Ser Gin lie Pro Ser Thr Gly Leu Leu Ala Asp Gly Val Gly Glu Ala 
305 310 315 320 

Ala Gin He Tyr Cys Glu Leu Val Arg Leu Gly Ala Gly Met Lys Phe 
325 330 335 

He Asp He Gly Gly Gly Leu Gly lie Asp Tyr Asp Gly Thr Lys Ser 
340 345 350 

Cys Asp Ser Asp Val Ser Val Gly Tyr Gly lie Gin Glu Tyr Ala Ser 
355 360 365 

Ala Val Val Gin Ala Val Gin Tyr Val Cys Asp Arg Lys Gly Val Lys 
370 375 380 

His Pro Val He Cys Ser Glu Ser Gly Arg Ala He Val Ser His His 
385 390 395 400 

Ser He Leu He Phe Glu Ala Val Ser Ala Ser Ser His Ser Cys Ser 
405 410 415 

Ser Ser His Leu Ser Ser Gly Gly Leu Gin Ser Met Ala Glu Thr Leu 
420 425 430 

Asn Glu Asp Ala Leu Ala Asp Tyr Arg Asn Leu Ser Ala Ala Ala Val 
435 440 445 

Arg Gly Glu Tyr Glu Thr Cys Val Leu Tyr Ser Asp Gin Leu Lys Gin 
450 455 460 

Arg Cys Val Asp Gin Phe Lys Glu Gly Ser Leu Gly He Glu His Leu 
465 470 475 480 

Ala Ala Val Asp Ser lie Cys Asp Phe Val Ser Lys Ala Met Gly Ala 
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485 490 495 

Ala Asp Pro Val Arg Thr Tyr His Val Asn Leu Ser lie Phe Thr Ser 
500 505 510 

lie Pro Asp Phe Trp Ala Phe Gly Gin Leu Phe Pro lie Val Pro lie 
515 520 525 

His Arg Leu Asp Glu Lys Pro Ala Val Arg Gly lie Leu Ser Asp Leu 
530 535 540 

Thr Cys Asp Ser Asp Gly Lys Val Asp Lys Phe lie Gly Gly Glu Ser 
545 550 555 560 

Ser Leu Pro Leu His Glu Leu Gly Ser Asn Gly Asp Gly Gly Gly Tyr 
565 570 575 

Tyr Leu Gly Met Phe Leu Gly Gly Ala Tyr Glu Glu Ala Leu Gly Gly 
580 585 590 

Leu His Asn Leu Phe Gly Gly Pro Ser Val Val Arg Val Val Gin Ser 
595 600 605 

Asp Ser Ala His Ser Phe Ala Met Thr Arg Ser Val Pro Gly Pro Ser 
610 615 620 

Cys Ala Asp Val Leu Arg Ala Met Gin His Glu Pro Glu Leu Met Phe 
625 630 635 640 

Glu Thr Leu Lys His Arg Ala Glu Glu Phe Leu Glu Gin Glu Asp Asp 
645 650 655 

Lys Gly Leu Ala Val Glu Ser Leu Ala Ser Ser Val Ala Gin Ser Phe 
660 665 670 

His Asn Met Pro Tyr Leu Val Ala Pro Ser Ser Cys Arg Phe Thr Ala 
675 680 685 

Ala Thr Asp Asn Asn Gly Gly Tyr Asn Tyr Tyr Tyr Ser Asp Glu Asn 
690 695 700 

Ala Ala Asp Ser Ala Thr Gly Glu Asp Glu lie Trp Ser Tyr Cys Thr 
705 710 715 720 

Ala 
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<210> 23 
<211> 2695 
<212> DNA 
<213> Plant 



<400> 23 

ttcacgttct cttctcaatt cccataaaag aaacccttcc gttaggtttc cgtcctattt 60 
tctcttcttc tacgcttcct cttctgatat caatatctgt atggtgtttt tcttgttcga 120 
attttagatt tgttttgcct ttaatacctg taaccttata attctctgtt taaaccaaaa 180 
acttagcttc ttctgaagtc agggtgggga tatttggatc gtgtaagagt gtgttagaag 240 
gtgattatct tttgattcag ttcctttttt gcttcttttg agggggtagc cggggcctcg 300 
gcctcggcgg gttttaatag cccccatcta ttacaaccat tgggcaaaaa catcattaaa 360 
tctgtacaaa gcaaaccctt aatttagttt aattttctgt attctttgat tctttaacag 420 
aagaagaaga gatgccggcc ctaggttgtt gcgtagacgc tactgtttcc cctcctctcg 480 
gctatgcctt ctctcgggat agctctcttc ccgcgccgga gttctttacc tccggcgtac 540 
ctcctacaaa ctccgccgcc ggttccattg ggtctccgga tctgtcctct gctttgtacg 600 
gggtcgatgg gtggggagct ccttatttct ccgttaactc taacggagat atctccgtcc 660 
gaccacatgg tacggacaca ctcccccacc aggaaattga ccttctcaag gtcgtgaaaa 72 0 
aggcctccga cccgaaaaat tcaggggggc tcgggcttca gctgcctctt gttgttcgct 780 
tccctgatgt gctaaaaaac cggttggaat ctctgcaatc ggcttttgat ctcgctgttc 840 
attcccaggg ctatggggcc cactaccaag gtgtttatcc cgtgaaatgc aatcaagaca 900 
ggttcgtggt ggaagatatt gtcaaattcg ggtcgtcatt ccggttcggg ttggaagctg 960 
ggtctaaacc cgagctcctg ttagccatga gctgtctctg caggggcagt gctgagggcc 1020 
ttctcgtttg caatggtttc aaggacgctg agtacatttc gcttgctttg gttgcaagaa 1080 
agctcatgtt aaacactgta attgttcttg aacaagagga ggagcttgac cttgtgattg 1140 
atataagccg taagatggct gttcggcccg taattggact tcgggctaag ctcaggacca 1200 
agcattcagg ccattttgga tccacttctg gagaaaaagg taagtttggg cttacaacga 1260 
cccaaattgt tcgtgtagtg aagaagctgg aagaatccgg aatgctggat tgccttcagt 1320 
tgctgcattt tcacattgga tctcagatcc cttcaacggc gttgcttgct gatggtgttg 1380 
gtgaggctgc tcagatttat tgtgaattaa tccgtcttgg tgcgggtatg aagttcattg 1440 
atactggagg tgggctcgga attgattatg atggtactaa atcatgtgat tcagatgtct 1500 
ctgttggcta tggcattcaa gaatacgcct ccacagttgt ccaggcggtt caatatgttt 1560 
gcgaccgtaa gggcgtgaag cacccagtga tttgcagcga aagtggcagg gcaattgttt 1620 
ctcatcactc aattctgatt ttcgaagccg tgtctgcttc tagtcactca tgttcttctt 168 0 
cacatctgtc ttctggtggc ctccaatcca tggcggagac gctcaatgaa gatgcccttg 1740 
ctgattaccg caatttatct gctgctgcag ttcgtggaga gtacgagacg tgtgtacttt 1800 
actctgatca gttgaaacag agatgtgtgg atcagtttaa agaagggtcc ttgggtattg 1860 
aacatcttgc tgctgttgat agcatctgtg attttgtatc aaaggctatg ggggctgctg 1920 
atcctatccg cacttaccat gtgaatctgt caattttcac ttcaattcct gatttttggg 1980 
cctttggtca attgtttccg attgttccaa tacaccgttt agatgaaaag cctgcagtaa 2040 
ggggaatatt atcggacttg acttgtgaca gtgatgggaa ggttgataag ttcattggtg 2100 
gcgaatcaag cttgcagctg catgaattgg gaagtaatgg cgatggtggt gggtattatc 2160 
tggggatgtt tttgggtggg gcttatgagg aggcgctcgg aggactccac aacctgtttg 2220 
gtggaccaag cgtggtgcgc gtggtgcaga gcgatagcgc tcacagcttc gccatgtctc 2280 
gctccgtccc tggcccgtcc tgcgcggacg tgctccgagc gatgcagcac gagcccgagc 2340 
tcatgttcga gactctcaag caccgtgcgg aggaattctt ggaacaagaa gaagacaaag 2400 
ggctggccat tgcatctttg gccagcagct tagctcagtc cttccataac atgccttacc 2460 
ttgtggcgcc tgcatcttgc tgcttcactg cagttactgc taacaacggt ggctataact 2520 
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actattacag tgatgagaat gcagcagatt ctgctacagg ggaggatgag atttggtcct 258 0 
attgcactgc ttgaagtgtt gtcgtagcat ctccagtttt agtttgtcgt cgaagttgtc 2640 
tgtttttgaa taataccctt agttggtgat gtttttctaa aaaaaaaaaa aaaaa 2695 

<210> 24 
<211> 720 
<212> PRT 
<213> Plant 

<400> 24 

Met Pro Ala Leu Gly Cys Cys Val Asp Ala Thr Val Ser Pro Pro Leu 
15 10 15 

Gly Tyr Ala Phe Ser Arg Asp Ser Ser Leu Pro Ala Pro Glu Phe Phe 
20 25 30 

Thr Ser Gly Val Pro Pro Thr Asn Ser Ala Ala Gly Ser lie Gly Ser 
35 40 45 

Pro Asp Leu Ser Ser Ala Leu Tyr Gly Val Asp Gly Trp Gly Ala Pro 
50 55 60 

Tyr Phe Ser Val Asn Ser Asn Gly Asp lie Ser Val Arg Pro His Gly 
65 70 75 80 

Thr Asp Thr Leu Pro His Gin Glu lie Asp Leu Leu Lys Val Val Lys 
85 90 95 

Lys Ala Ser Asp Pro Lys Asn Ser Gly Gly Leu Gly Leu Gin Leu Pro 
100 105 110 

Leu Val Val Arg Phe Pro Asp Val Leu Lys Asn Arg Leu Glu Ser Leu 
115 120 125 

Gin Ser Ala Phe Asp Leu Ala Val His Ser Gin Gly Tyr Gly Ala His 
130 135 140 

Tyr Gin Gly Val Tyr Pro Val Lys Cys Asn Gin Asp Arg Phe Val Val 
145 150 155 160 

Glu Asp lie Val Lys Phe Gly Ser Ser Phe Arg Phe Gly Leu Glu Ala 
165 170 175 

Gly Ser Lys Pro Glu Leu Leu Leu Ala Met Ser Cys Leu Cys Arg Gly 
180 185 190 

Ser Ala Glu Gly Leu Leu Val Cys Asn Gly Phe Lys Asp Ala Glu Tyr 
195 200 205 
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lie Ser Leu Ala Leu 
210 

Val Leu Glu Gin Glu 
225 

Lys Met Ala Val Arg 
245 

Lys His Ser Gly His 
260 

Gly Leu Thr Thr Thr 
275 

Ser Gly Met Leu Asp 
290 

Gin lie Pro Ser Thr 
305 

Gin lie Tyr Cys Glu 
325 

Asp Thr Gly Gly Gly 
340 

Asp Ser Asp Val Ser 
355 

Val Val Gin Ala Val 
370 

Pro Val lie Cys Ser 
385 

lie Leu lie Phe Glu 
405 

Ser His Leu Ser Ser 
420 

Glu Asp Ala Leu Ala 
435 

Gly Glu Tyr Glu Thr 
450 



Val Ala Arg Lys Leu Met 
215 

Glu Glu Leu Asp Leu Val 

230 235 

Pro Val lie Gly Leu Arg 
250 

Phe Gly Ser Thr Ser Gly 
265 

Gin lie Val Arg Val Val 
280 

Cys Leu Gin Leu Leu His 
295 

Ala Leu Leu Ala Asp Gly 
310 315 

Leu lie Arg Leu Gly Ala 
330 

Leu Gly lie Asp Tyr Asp 
345 

Val Gly Tyr Gly lie Gin 
360 

Gin Tyr Val Cys Asp Arg 
375 

Glu Ser Gly Arg Ala lie 
390 395 

Ala Val Ser Ala Ser Ser 
410 

Gly Gly Leu Gin Ser Met 
425 

Asp Tyr Arg Asn Leu Ser 
440 

Cys Val Leu Tyr Ser Asp 
455 



Leu Asn Thr Val lie 
220 

lie Asp lie Ser Arg 
240 

Ala Lys Leu Arg Thr 
255 

Glu Lys Gly Lys Phe 
270 

Lys Lys Leu Glu Glu 
285 

Phe His lie Gly Ser 
300 

Val Gly Glu Ala Ala 
320 

Gly Met Lys Phe lie 
335 

Gly Thr Lys Ser Cys 
350 

Glu Tyr Ala Ser Thr 
365 

Lys Gly Val Lys His 
380 

Val Ser His His Ser 
400 

His Ser Cys Ser Ser 
415 

Ala Glu Thr Leu Asn 

430 

Ala Ala Ala Val Arg 
445 

Gin Leu Lys Gin Arg 
460 
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Cys Val Asp Gin Phe Lys Glu Gly Ser Leu Gly lie Glu His Leu Ala 
465 470 475 480 

Ala Val Asp Ser lie Cys Asp Phe Val Ser Lys Ala Met Gly Ala Ala 
485 490 495 

Asp Pro lie Arg Thr Tyr His Val Asn Leu Ser lie Phe Thr Ser lie 
500 505 510 

Pro Asp Phe Trp Ala Phe Gly Gin Leu Phe Pro lie Val Pro lie His 
515 520 525 

Arg Leu Asp Glu Lys Pro Ala Val Arg Gly lie Leu Ser Asp Leu Thr 
530 535 540 

Cys Asp Ser Asp Gly Lys Val Asp Lys Phe lie Gly Gly Glu Ser Ser 
545 550 555 560 

Leu Gin Leu His Glu Leu Gly Ser Asn Gly Asp Gly Gly Gly Tyr Tyr 
565 570 575 

Leu Gly Met Phe Leu Gly Gly Ala Tyr Glu Glu Ala Leu Gly Gly Leu 
580 585 590 

His Asn Leu Phe Gly Gly Pro Ser Val Val Arg Val Val Gin Ser Asp 
595 600 605 

Ser Ala His Ser Phe Ala Met Ser Arg Ser Val Pro Gly Pro Ser Cys 
610 615 620 

Ala Asp Val Leu Arg Ala Met Gin His Glu Pro Glu Leu Met Phe Glu 
625 630 635 640 

Thr Leu Lys His Arg Ala Glu Glu Phe Leu Glu Gin Glu Glu Asp Lys 
645 650 655 

Gly Leu Ala lie Ala Ser Leu Ala Ser Ser Leu Ala Gin Ser Phe His 
660 665 670 

Asn Met Pro Tyr Leu Val Ala Pro Ala Ser Cys Cys Phe Thr Ala Val 
675 680 685 

Thr Ala Asn Asn Gly Gly Tyr Asn Tyr Tyr Tyr Ser Asp Glu Asn Ala 
690 695 700 

Ala Asp Ser Ala Thr Gly Glu Asp Glu lie Trp Ser Tyr Cys Thr Ala 
705 710 715 720 
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<210> 25 
<211> 914 
<212> DNA 
<213> Plant 



<400> 25 

aagctcgann ttaancctca ntaaagggaa 
gtcgacggta tcgataagct tgattaagct 
ttttaggcgc ggcctattcc ctttggctat 
gatttcctcc ataaattctc cgatctaaat 
ttgttggagt tgtttggatg ggtgtttacc 
tccgtaagta acttaagtgc aacatggaaa 
aagcttaatc gaattcctgc agcccggggg 
ggtggagctc caattcgccc tatagtgagt 
caacgtcgtg actgggaaan gcctggcgtt 
ctttcgccag ctgggcgtaa tagcgaanag 
acctgatggn gaatgggacn gccctgtanc 
ccccancgtg acgcnaactt gcaacgccta 
aagttcncgg ctttccgtaa gtccaagcgg 
ccccnaaact gttanggtnt gtgatttggc 
ttaaangcct ntcaacngaa accaccatcg 
nngtaaatta nntn 



caaaagctgg taccgnggcc ccccctcgag 60 
tagtangcac attagcagcg cttgggatga 120 
ataatcgtgt ggttctggga attaaaaccc 180 
ggcanagaat tttcatattt ataccttttc 240 
ccaaagtgtt cctgggactg catgcataca 300 
atttcattga gaggaatcag caaaaaaaaa 360 
atccactagt tctagagcgg ccgccaccgc 420 
cgtattacaa ttcactgggc cgtcgtttta 480 
accaacttaa tcgccttgca gcacatcccc 540 
gccgcacgat cgccttccca acagttgcgc 600 
ngcgcangaa ncgcggcngg tgtggtggta 660 
acgccgcnct tcgcttctcc ttcttctngc 720 
gggnccttag gttcgattat gttnggaccc 780 
accccaaaac gtttcccttg nctggtcntt 840 
gnacttgtta aggntncctt gcntgnaaaa 900 

914 



<210> 26 
<211> 829 
<212> DNA 
<213> Plant 



<400> 26 

agcaagctcg atatcgccct cactaaaggg aacaaaaact ggtaccgggc cccccctcga 60 
ggtcgacggt atcgataagc ttgattaanc tttttttttt tgaactacac aagggaattt 12 0 
cttctcctna gtaacacatg agaataatta gtgcaataaa ttacaagagg aacattgcag 180 
ttggatttaa gaatctgcgc tggggaattt agcctcaata tttgctacaa ccgtacagat 240 
ttcactgcat tcatgaacga tagtatccgt gacacatcct tttggatgcc gtcctgtcca 300 
catatgccac tactcacatc cactccattg ggtttaagtt gcagaaagag cttcacaaac 360 
attctccggg ttaattcctc ctgccaagag ccacccatgt ttgcttctaa tgcgcggcag 42 0 
cttaaactga acccagttga atcctttgcc actgccacct tttgcactat ccactagaac 480 
ccaatcaacc agagaagact cctcatcaga aatatagttc aaaaggctcc tcttcatttg 540 
catgaagtac gtatattaca cgtttttccc tgaccaaagc ttaatcgaat tctgcagccc 600 
gggggatcng gnattctaga gcggcgccac gcggtggagc tccaatcgcc taaatgancn 660 
ataaaatcac tggccgtcgt ttanacgncn ggacgggaaa cctgggtacc aacttaatcg 720 
cctgnagcna tccccttcnc agcggngtan acgaaaggcc gncgattgcc tccanattgc 780 
cacnggatgg aanggacncc gtncgganga acngggggnn ggggtaccn 82 9 
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